#Enhanced Divvy data
Divvy is hosting its second annual Data Challenge in 2015, with submissings due March 18, 2015 (prizes include an Xbox One and a Windows phone). Divvy Munging is a small project to provide modified & enhanced data so entrants can create a project faster.
- Divvy's data, a package of 2013 and 2014 trips (64 MB) or browse the unzipped datasets in the 2014 Divvy data folder.
- Ginormous CSV of all possible route shapes by Drew DePriest (using MapQuest Directions). Distances are in feet.
- Here is a MySQL version of the combined trip, station, and station capacity data from 2013 and 2014: divvy.tar.gz. Run
tar -xzf divvy.tar.gz
to unzip the file. The file was created withmysqldump
, see here for a description of how to load it into MySQL.
- Divvy occasionally moves stations (but station IDs persist). There isn't yet a list of station moves.
Contribute your stats by opening an issue or submitting a pull request.
This enhances the Divvy dataset by adding Google Maps-calculated distances between every station pair.
- CSV version: Divvy_Trips_and_Distances_2013.csv.zip
- SQL version: Divvy_Trips_and_Distances_2013.sql.zip.
- Hourly weather from Weather Underground via GT Wrobel
This "master file" contains a table, ready for import into a MySQL database, that combines Divvy’s year 2013 trip data (released February 11, 2014) with corrected station names, station IDs that match Divvy’s stations availability API, launch times, geographic coordinates, precalculated member age, and estimated bicycling distance between stations (see notes).
- Distances are in meters. The distance represents the Google Maps Bicycle directions distance for that station pair and no route was inspected before the algorithm’s route was accepted into the dataset. More about this distance data.
- Launch dates were corrected (there used to be a message here that said don't trust them)
- From Divvy's readme: "Trips that did not include a start or end date were removed from original table" is standard language Alta Bicycle Share uses across all data releases. In the case of Divvy's 2013 data, there were no trips without a start or end date removed from the original table. In general, these kinds of trips would occur from missing/stolen bikes.
Looking for more stats? Check here, here, and here.
69% of Divvy members are men, 31% are women, as of the end of 2013, and February 14, 2014. As of this writing, survey responses were not available. From Divvy.
1.8% of male workers 16 years and older primarily used a bicycle to get to work, while 0.7% of female workers 16 years and older did. Both have a margin of error of ±0.1%. Considering there were 625,289 male workers 16+ (MOE of ±4,264) and 588,612 female workers 16+ (MOE of ±4,113), that's a ratio of 73.20% to 26.80%. American Community Survey, 2008-2012, table S08010, via American FactFinder.
In 2013, 79.05% of member trips were by male members and 20.95% by female members. From my analysis of the trip data.
The pairwise distance was created by Nick Bennet (with assistance from Smart Chicago Collaborative), and the SQL query to merge them by Eric Van Zanten.
- Review pages on the Bike Sharing Data Hackpad: Divvy Data II and Chicago data & experiences
- Ask for it on Twitter with the #DivvyData hashtag.
- Post in the Divvy Data Google group.
- Come to Open Gov Hack Night on Tuesday nights at the Merchandise Mart
- Contact Steven Vance, repo creator if the above methods don't work out