Data bias at every step

Posted to Design  |  Tags:  |  Nathan Yau

Lena Groeger for ProPublica describes when the designer shows up in the design, not just in the visualization part but also in collection, selection, and aggregation. Our perspective always comes to play.

The effects may be subtle, but if we pour so much of ourselves into the stories we tell, the data we gather, the visuals we design, the webpages we build, then we should take responsibility for them. And that means not just accepting the limits of our own perspective, but actively seeking out people who can bring in new ones.

It’s common to think of data and analysis as unbiased fact. Concrete. You can’t argue with numbers. However, that’s rarely the case. We analyze and visualize with preconceptions, and that drives many aspects of whatever comes next.

Analysis is a process driven by experience. Technically, this means learning new methods as you look at various data types and situations. Contextually, this means forming conclusions based on what you know about the subject matter. If there are knowledge gaps technically or contextually, you run into issues.

Favorites

Years You Have Left to Live, Probably

The individual data points of life are much less predictable than the average. Here’s a simulation that shows you how much time is left on the clock.

Where People Run in Major Cities

There are many exercise apps that allow you to keep track of your running, riding, and other activities. Record speed, …

Jobs Charted by State and Salary

Jobs and pay can vary a lot depending on where you live, based on 2013 data from the Bureau of Labor Statistics. Here’s an interactive to look.

How You Will Die

So far we’ve seen when you will die and how other people tend to die. Now let’s put the two together to see how and when you will die, given your sex, race, and age.