Posted to

Statistics

How well we don’t understand probability

All Things Considered on NPR ran a fine series on how we interpret probability and uncertainty. It came in five bits (plus one follow-up), each…

A more visual world data portal

One of the most annoying parts of downloading data from large portals is that you never quite know what you're gonna get. It's a box…

Large-ish data packages in R

If you've played around with R enough, there comes a time when you just need some data to mess around with. Maybe it's to learn…

Polling for stress

NPR, the Robert Wood Johnson Foundation and the Harvard School of Public Health conducted a survey about peoples' stress levels and factors contributing to the…

Data science, big data, and statistics – all together now

Terry Speed, a emeritus professor in statistics at University of California at Berkeley, gave an excellent talk on how statisticians can play nice with…

Subpar Captain America

Animation Domination High-Def has a Captain America video of things that America is not so good at, relative to other countries. And they even cited…

Test your statistical wits about stuff in the world

Many of us aren't aware of how one country compares to others or public policy that has been around for decades. How Wrong You Are…

Lessons from improperly anonymized taxi logs

Through a Freedom of Information request Chris Whong received and eventually released NYC taxi logs starting in 2013 (about 173 million trips). Vijay Pandurangan looked…

Data grab bag

— When you deal with data, you can think like a statistician, even if you don't know the math (although it will certainly help a…

How to Make Government Data Sites Better

Accessing government data from the source is frustrating. If you've done it, or at least tried to, you know the pain that is oddly formatted…

What a few thousand fake followers gets you

There are a lot of fake, spammy accounts on Twitter that come in a variety of forms. Some tweet links to junk, some serve as…

GDP rises in the UK after spending on illegal activities counted

The gross domestic product for the United Kingdom rose by 5% seemingly overnight, after spending on cocaine and prostitution was (roughly) accounted for. Naturally there's…

What pregnant women want

In another take on the game of what Google suggests while searching, Seth Stephens-Davidowitz for The New York Times looked at queries related to pregnant…

Strava Metro aims to help cities improve biking routes

Last month, Strava, which allows users to track their bike rides and runs, launched an interactive map that shows where people move worldwide. That seems…

Machine learning a cappella on overfitting

From the machine learning course on Udacity, an a cappella group sings a Thriller parody on overfitting. At first you're like, "Is this real? Am…

A majority of your email in Gmail, even if you don’t use it

For reasons of autonomy, control, and privacy, Benjamin Mako Hill runs his own email server. After a closer look though, he realized that much of…

Newborn false positives

Shutterfly sent promotional emails that congratulate new parents and encourage them to send thank you cards. The problem: a lot of people on that list…

Random things that correlate

This is fun. Tyler Vigen wrote a program that attempts to automatically find things that correlate. As of writing this, 4,000 correlations were found so…

Type I and II errors simplified

"Type I" and "Type II" errors, names first given by Jerzy Neyman and Egon Pearson to describe rejecting a null hypothesis when it's true and…

Naked Statistics

Naked Statistics by Charles Wheelan promises a fun, non-boring introduction to statistics that doesn't leave you drifting off into space, thinking about anything that is…

Most underrated films

Ben Moore was curious about overrated and underrated films. "Overrated" and "underrated" are slippery terms to try to quantify. An interesting way of looking at…

Hip hop vocabulary compared between artists

Matt Daniels compared rappers' vocabularies to find out who knows the most words. Literary elites love to rep Shakespeare's vocabulary: across his entire corpus, he…