Summary Statistics Tell You Little About the Big Picture

By Nathan Yau  /  Posted to Guides  /  Tags: ,

Mean, median, and mode. These are the first things you learn about in your introductory statistics course. It’s often all you hear about when you see data in the news. People form policies for populations, based on the generalized numbers.

However, these summary statistics can only tell you so much about a dataset, which means you can only learn a limited amount about what the data represents — the people, places, and things.

If you’re the one who consumes the data, you should wonder what the means and medians actually represent. If you’re the one who analyzes the data, spend time with the most granular that time and resources allow for. Something more interesting will almost always come out of it.

Chart Type Used

Histogram

It looks like a bar chart, but it reads differently. The baseline is continuous instead of discrete categories, which allows one to see distributions.

Favorites

Years You Have Left to Live, Probably

The individual data points of life are much less predictable than the average. Here’s a simulation that shows you how much time is left on the clock.

How You Will Die

So far we’ve seen when you will die and how other people tend to die. Now let’s put the two together to see how and when you will die, given your sex, race, and age.

Causes of Death

There are many ways to die. Cancer. Infection. Mental. External. This is how different groups of people died over the past 10 years, visualized by age.

Watching the growth of Walmart – now with 100% more Sam’s Club

The ever so popular Walmart growth map gets an update, and yes, it still looks like a wildfire. Sam’s Club follows soon after, although not nearly as vigorously.