Useless Data Comparisons
It’s in our nature to compare things. What’s better? What’s worse? What falls in the middle? However, make sure you don’t end up in an apples and oranges situation where the comparisons don’t even make sense or are completely useless.
How weird that the states with large populations have a lot of cancer deaths. There must be something weird about living in areas with a lot of people that make cancer more prevalent. Therefore, I should live in North Dakota.
Okay, but you’re looking at absolute counts again. California has a large population, so there are more deaths there. When you compare the rates for each state, there’s a lot less variation. This is not to say that absolute counts are pointless. But if the data is directly related to group size, then you should account for that.
White people must be really fertile and have a lot of kids! Look how many more white women had a child than all the other groups. It’s not even close.
Okay, but the chart shows absolute counts. There are many more white people than the others, so there are inevitably more women who gave birth in total. But, if you look at rates — say, number of women who gave birth per 1,000 women in any given group — then white is the lowest and native Hawaiian is the highest.
I was a tiny baby when I was born. Now look at me! Extrapolate that. I’m going to be a giant, because when I’m double my age, my height and weight will also be double. Look out world. Here I come.
Okay, but. Well. Probably not. You can’t just extrapolate linearly like that. Height and weight levels off. You know that, right? Have you grown in height at all during the past decade? Maybe look at more data points in between birth and your current age.
The cost of everything is going up! Ground beef costs twice as much as it did in 1980. How am I going to make my mom’s famous omelettes when eggs prices are going through the roof? The world is crumbling to pieces. I want to cry.
Okay, but these average prices don’t account for inflation. You can’t compare monetary values over time without thinking about the changing value of a dollar. One dollar in 1980 is the value of about three dollars in 2017. So the ground beef price of $1.82 in 1980 dollars is $5.73 in 2017 dollars, meaning the price decreased.
I can jump higher than Michael Jordan in his prime, I am taller than the average professional basketball player, and I shoot better than most. SHOW ME THE MONEY.
Okay, but players don’t get ladders or stools during basketball games. I mean, seeing as you’re interested in becoming an NBA player, you’ve seen at least one game, right?
Apples are often red. Oranges are often orange. Different!
Okay, but you’re comparing apples and oranges. Oh wait. This comparison actually works in this case. Never mind.
Become a member. Learn to visualize data. From beginner to advanced.Join Today
This is for people interested in the process of creating, designing, and exploring data graphics. Your support goes directly to FlowingData, an independently run site.
What You Get
- Instant access to tutorials on how to make and design data graphics
- Source code and files to use with your own data
- In-depth courses on visualization in R
- Hand-picked links and resources from around the web
Projects by Nathan Yau See All →
Think Like a Statistician – Without the Math
I call myself a statistician, because, well, I’m a statistics graduate student. However, the most important things I’ve learned are less formal, but have proven extremely useful when working/playing with data.
Life expectancy changes
The data goes back to 1960 and up to the most current estimates for 2009. Each line represents a country.
Unemployment in America, Mapped Over Time
Watch the regional changes across the country from 1990 to 2016.