Citi Bike, also known as NYC Bike Share, is releasing monthly data dumps…
Data Sources
Have fun and play with some numbers.
-
Bike share data in New York, animated
-
ProPublica opened a data store
One of the main challenges of any data project is getting the data.…
-
Texting data to save lives
Remember that TED talk from a couple of years ago on texting patterns…
-
Cancer data for the U.S. released
The Centers for Disease Control and Prevention released their most recent cancer data…
-
Government data shutdown
When you go to the United States Census site, Data.gov, or similar government-run…
-
Data.gov revamp
After budget cuts a couple of years ago, I assumed Data.gov was all…
-
Link
A big collection of sites and services for accessing data
Andy Kirk put together a big collection of sites and services for accessing data. It’s essentially a big ol’ data dump.
-
Link
ScraperWiki
Easily scrape tweets and download them as a spreadsheet with ScraperWiki.
-
Medicare provider charge data released
The Centers for Medicare and Medicaid Services released billing data for more than…
-
Link
Yelp Dataset Challenge
Yelp is putting up over 200,000 reviews and offering ten $5,000 awards for students who want to make use of their data.
-
Link
John Snow’s Cholera data in more formats
John Snow’s Cholera data in more formats. Includes death locations, pump locations, the original map, and Ordinance Survey maps. This could be useful for a class or if you want to kick the tires on some mapping software.
-
Link
Quandl
Quandl is a search engine for time series data. Similar to DataMarket, but probably with more straightforward download.
-
Link
General Social Survey
The General Social Survey has been running since 1972, and many questions have remain unchanged to make comparisons possible. The data from then to 2012 can now be downloaded in a variety of formats. [via]
-
Link
Three Decades of Decennial Data
The United States Census Bureau just made more data available via their API. You can now access decennial data for 1990, 2000, and 2010. The API isn’t especially advanced, but it’s a heck of a lot better than PDF tables.
-
Link
Cat Dataset →
2 gigs of cat data with images and eye, mouth, and ear positions. Yeah.
-
Archive of datasets bundled with R
R comes with a lot of datasets, some with the core distribution and…
-
Data on decades of Boy Scout expulsions released
The Los Angeles Times released nearly 5,000 records of allegations from the Boy…
-
Losing American Community Survey would be ‘disastrous’
Many want to get rid of the American Community Survey, a Census program…
-
A Future Without Key Social and Economic Statistics for the Country
Robert Groves, director of the U.S. Census Bureau, on the Appropriations Bill:
The… -
CNN transcript collection, 2000-2012
Thanks to the Internet Archive and CNN, thirteen years of transcripts, about a…