Big data, same statistical challenges

April 4, 2014

Topic

Statistics / big data, Financial Times

Tim Harford for Financial Times on big data and how the same problems for small data still apply:

The multiple-comparisons problem arises when a researcher looks at many possible patterns. Consider a randomised trial in which vitamins are given to some primary schoolchildren and placebos are given to others. Do the vitamins work? That all depends on what we mean by “work”. The researchers could look at the children’s height, weight, prevalence of tooth decay, classroom behaviour, test scores, even (after waiting) prison record or earnings at the age of 25. Then there are combinations to check: do the vitamins have an effect on the poorer kids, the richer kids, the boys, the girls? Test enough different correlations and fluke results will drown out the real discoveries.

You’re usually in for a fluffy article about drowning and social media when ‘big data’ is in the title. This one is worth the full read.

Projects by FlowingData See All →

Percentage of People Who Married, Given Your Age

Or, given your age, the percentage of fish left in the sea. Here’s a chart.

McDonald’s Locations vs. Golf Courses

There are thousands of McDonald’s locations, but there are still more golf courses in the United States. This seems surprising, but some maps make it clear.

Data Underload #8 – Unsolicited

A few months back, the Caltrans Performance Measurement System (PeMS) …

Daily Routine, 2020

After looking at how much time we spent on daily activities in 2020, let’s look at when we spent our time.

Big data, same statistical challenges

Topic

Get the Book

Visualize This: The FlowingData Guide to Design, Visualization, and Statistics

Projects by FlowingData See All →

Percentage of People Who Married, Given Your Age

McDonald’s Locations vs. Golf Courses

Data Underload #8 – Unsolicited

Daily Routine, 2020

Big data, same statistical challenges

Topic

Related

Get the Book

Visualize This: The FlowingData Guide to Design, Visualization, and Statistics

Projects by FlowingData See All →

Percentage of People Who Married, Given Your Age

McDonald’s Locations vs. Golf Courses

Data Underload #8 – Unsolicited

Daily Routine, 2020