Citibike Ridership Data Neighborhood Identifier

This is a Pandas script that corresponds the start/end coordinates of Citibike rides with NYC neighborhoods.

Why?

Lyft, the owner of Citibike, publishes anonymous ridership data at the start of every month. These datasets contain the information about where and when the rides start, how they end, and whether the rider held a Citibike subscription. While the data is used in Lyft's own reports, it is also available for the public.

One thing Lyft does not do is identify which neighborhoods correspond to the start/end coordinates. I was working on a data story about the effects of congestion pricing in NYC and got very disappointed by that.

How?

I used the official shapefile for the 2020 NYC Tabulation Areas and Nominatim to correspond each pair of coordinates with the neighborhood to which they belong. I created two additional columns (start_neighborhood and end_neighborhood) so the data would be easier to work with.

I was specifically working with the March 2024 and March 2025 datasets, which you don't have to do. Just go to the Lyft website and look up the name of the file you wish to import. By following the scripts outlined here, you can learn to perform a spatial join from start to finish for your own projects.

Results

As for my personal project, here are some graphics (created with Datawrapper) that I was able to include in my data story.

Use, sharing, contributing

Feel free to use this code however you want. Perhaps in the future I can turn this into a library of useful Pandas scripts for data journalists.

Notes

Please do not remove or modify the shapefiles folder.
This notebook contains functions specific to my project (I was researching the rides between Brooklyn and Manhattan, specifically the congestion relief zone). Feel free to remove them before running the script on your machine.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
assets		assets
shapefiles		shapefiles
.DS_Store		.DS_Store
LICENSE		LICENSE
README.md		README.md
bikeshare.ipynb		bikeshare.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Citibike Ridership Data Neighborhood Identifier

Why?

How?

Results

Use, sharing, contributing

Notes

About

Uh oh!

Languages

License

vasilybels/bikeshare

Folders and files

Latest commit

History

Repository files navigation

Citibike Ridership Data Neighborhood Identifier

Why?

How?

Results

Use, sharing, contributing

Notes

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Languages