Data bias at every step

Posted to Design  |  Tags:  |  Nathan Yau

Lena Groeger for ProPublica describes when the designer shows up in the design, not just in the visualization part but also in collection, selection, and aggregation. Our perspective always comes to play.

The effects may be subtle, but if we pour so much of ourselves into the stories we tell, the data we gather, the visuals we design, the webpages we build, then we should take responsibility for them. And that means not just accepting the limits of our own perspective, but actively seeking out people who can bring in new ones.

It’s common to think of data and analysis as unbiased fact. Concrete. You can’t argue with numbers. However, that’s rarely the case. We analyze and visualize with preconceptions, and that drives many aspects of whatever comes next.

Analysis is a process driven by experience. Technically, this means learning new methods as you look at various data types and situations. Contextually, this means forming conclusions based on what you know about the subject matter. If there are knowledge gaps technically or contextually, you run into issues.


A Day in the Life of Americans

I wanted to see how daily patterns emerge at the individual level and how a person’s entire day plays out. So I simulated 1,000 of them.

The Best Data Visualization Projects of 2014

It’s always tough to pick my favorite visualization projects. Nevertheless, I gave it a go.

Divorce Rates for Different Groups

We know when people usually get married. We know who never marries. Finally, it’s time to look at the other side: divorce and remarriage.

Causes of Death

There are many ways to die. Cancer. Infection. Mental. External. This is how different groups of people died over the past 10 years, visualized by age.