Metadata surveillance investigation

Posted to Statistics  |  Tags: ,  |  Nathan Yau

Metadata can tell you a lot, and most of us agree that it’s not “just metadata” at this point. The Share Lab shows what one can find, just using everyday tools and relatively straightforward analysis.

Although our investigation primarily discovered relations, patterns and anomalies of someone’s work life, it still gave us an insight into that person’s habits that border with private life. In the end, metadata scans someone’s behaviors on a much deeper level than traditional surveillance practice related to content could ever do.

The graphic above shows how people in the sample dataset emailed with others over. There’s no email content, but the headers provide enough information to sniff out connections.

See also: the search for Paul Revere with network analysis.

Favorites

Divorce Rates for Different Groups

We know when people usually get married. We know who never marries. Finally, it’s time to look at the other side: divorce and remarriage.

Think Like a Statistician – Without the Math

I call myself a statistician, because, well, I’m a statistics graduate student. However, the most important things I’ve learned are less formal, but have proven extremely useful when working/playing with data.

How You Will Die

So far we’ve seen when you will die and how other people tend to die. Now let’s put the two together to see how and when you will die, given your sex, race, and age.

A Day in the Life of Americans

I wanted to see how daily patterns emerge at the individual level and how a person’s entire day plays out. So I simulated 1,000 of them.