Taking over an old New York Times project, ProPublica re-launches Represent, which offers…
Statistics
More than mean, median, and mode.
-
Track what your government representatives are doing for you
-
The Guardian analyzes 70m comments, unearthing online abuse
Online comments are an odd entity that can get out of hand quickly,…
-
Treating visualization as a process
Many people think of visualization as a plug-in tool that spits out something…
-
Stephen Curry statistical dominance
Robert O’Connell for the Atlantic ponders basketball analytics and the rise of Stephen…
-
Link
What we’ve learned about sharing our data analysis →
“If an article includes our own calculations…then you should be able to see—and potentially criticize—how we did it.”
-
Moving to the “worst” place in America
In 1999, the Department of Agriculture published a Natural Amenities Scale that took…
-
Data scientists mostly just do arithmetic
Noah Lorang, a data scientist at Basecamp, explains the key for most companies…
-
Emergency room data in R
For my graphic on emergency room visits over time and the other on…
-
Math of crime and terrorism
Numberphile, from the Mathematical Sciences Research Institute, is one my new favorite YouTube…
-
Predictive policing
Crime and data have an old history together, but because there are new…
-
Campaign Finance API moves to ProPublica
Back in 2008, the New York Times rolled out a campaign finance API…
-
Catalog of criminal justice data
There’s a lot of data on criminal justice — prison populations, crime rates,…
-
Link
Yahoo News feed dataset for researchers →
A big ol’ dataset on interaction with the list of news items on the homepage.
-
Game: Guess the correlation
Guess the Correlation is a straightforward game where you do just that, and…
-
Playing with fonts using neural networks
Erik Bernhardsson downloaded 50,000 fonts and then threw them to the neural networks…
-
Missing 11th of the month
David Hagan looked closer at why the 11th of the month appeared to…
-
Kaggle Datasets for a place to converge on public data
Kaggle just opened up a Datasets section to download and analyze public data.…
-
Nerdy Powerball FAQ
The Powerball FAQ was most likely written by a slightly annoyed statistician. You’d…
-
Data on people who went to ER for wall-punching
Keith Collins for Quartz ran some quick numbers for people who visited the…
-
NYPL public domain data
The New York Public Library just made over 180,000 digital items in the…