Microsoft researcher Kate Crawford describes several myths of big data. Myth #4: It…
Statistics
More than mean, median, and mode.
-
Myths of big data
-
Link
ScraperWiki
Easily scrape tweets and download them as a spreadsheet with ScraperWiki.
-
Medicare provider charge data released
The Centers for Medicare and Medicaid Services released billing data for more than…
-
Convergence of Miss Korea faces
After seeing a Reddit post on the convergence of Miss Korea faces, supposedly…
-
Length of the average dissertation
On R is My Friend, as a way to procrastinate on his own…
-
Link
Path Social Networking App Settles FTC Charges
Path, a social networking app that lets you track and share personal information about yourself, settled with the FTC for $800k. They allegedly collected kids’ personal information without their parents’ consent. This is from February but important to know.
-
The Numbers Game on National Geographic
Jake Porway, the founder of DataKind, has a new show on the National…
-
Link
What your zip code reveals about you
What your zip code reveals about you. More on data brokers and how advertisers and businesses use your information to sell you stuff.
-
Link
500K degree
A prediction on the cost of a college degree in 2030. Interesting, although the model is simplistic, namely it assumes tuitions will never level off (if just for a little while) and when no one can afford tuition, there’s going to be more change. Plus the Internet is changing things.
-
Link
Looking for a data scientist?
Looking for a data scientist? Here’s what you should look for skills-wise.
-
Problematic databases used to track employee theft
Employee theft accounts for billions of dollars of lost merchandise per year, so…
-
Link
Yelp Dataset Challenge
Yelp is putting up over 200,000 reviews and offering ten $5,000 awards for students who want to make use of their data.
-
How to become a password cracker in a day
Deputy editor at Ars Technica Nate Anderson was curious if he could learn…
-
Odds of a perfect NCAA March Madness bracket
Math professor Jeff Bergen explains the odds of picking a perfect bracket.…
-
Declining songwriter ratings with age
Do singer-songwriters age well like a fine wine, or does quality decline with…
-
Link
John Snow’s Cholera data in more formats
John Snow’s Cholera data in more formats. Includes death locations, pump locations, the original map, and Ordinance Survey maps. This could be useful for a class or if you want to kick the tires on some mapping software.
-
Link
Nate Silver Discusses Data Bias, Strangeness of Fame
-
Data hackathon challenges and why questions are important
Jake Porway, executive director of DataKind on data hackathons and why they require…
-
What data brokers know about you
Lois Beckett for ProPublica has a thorough piece on data brokers — companies…
-
Link
Quandl
Quandl is a search engine for time series data. Similar to DataMarket, but probably with more straightforward download.