When data gets creepy

Posted to Data Sharing  |  Tags: ,  |  Nathan Yau

You make and publish bits of data about yourself, intentionally and unintentionally, and it goes to the indexed public web or to companies’ private black boxes. Ben Goldacre explains why it’s worth caring about these traces. It’s less hoorah and more example-driven than these sort of articles tend to be, and there’s isn’t a single mention of being awash in data.

At the simplest level, even the act of putting lots of data in one place — and making it searchable — can change its accessibility. As a doctor, I have been to the house of a newspaper hoarder; as a researcher, I have been to the British Library newspaper archive. The difference between the two is not the amount of information, but rather the index. I recently found myself in the quiet coach on a train, near a stranger shouting into her phone. Between London and York she shared her (unusual) name, her plan to move jobs, her plan to steal a client list, and her wish that she’d snogged her boss. Her entire sense of privacy was predicated on an outdated model: none of what she said had any special interest to the people in coach H. One tweet with her name in would have changed that, and been searchable for ever.

Before you say you’re not the woman on the phone and that you have nothing to hide, also read this.

Favorites

Unemployment in America, Mapped Over Time

Watch the regional changes across the country from 1990 to 2016.

Most popular porn searches, by state

We’ve seen that we can learn from what people search for, through the eyes of Google suggestions: state stereotypes, national …

Marrying Age

People get married at various ages, but there are definite trends that vary across demographic groups. What do these trends look like?

Life expectancy changes

The data goes back to 1960 and up to the most current estimates for 2009. Each line represents a country.