Deep fried data

Oct 11, 2016

Maciej Ceglowski, the creator of bookmarking site Pinboard, spoke at the Library of Congress a couple of weeks ago about deep-fried data.

In our case, the deep-fryer is a toolbox of statistical techniques. The names keep changing—it used to be unsupervised learning, now it’s called big data or deep learning or AI. Next year it will be called something else. But the core ideas don’t change. You train a computer on lots of data, and it learns to recognize structure.

These techniques are effective, but the fact that the same generic approach works across a wide range of domains should make you suspicious about how much insight it’s adding.

And in any deep frying situation, a good question to ask is: what is this stuff being fried in?

Deep fried food is often delicious, regardless of what’s inside the golden breading. But deep inside — or really right there in the front of our minds — we know it’s not good for us. Moderation. Or at least make sure you go to the good food stand at the carnival that uses the clean oil.

See the talk and many others here.

Favorites

Divorce Rates for Different Groups

We know when people usually get married. We know who never marries. Finally, it’s time to look at the other side: divorce and remarriage.

How You Will Die

So far we’ve seen when you will die and how other people tend to die. Now let’s put the two together to see how and when you will die, given your sex, race, and age.

Life expectancy changes

The data goes back to 1960 and up to the most current estimates for 2009. Each line represents a country.

The Most Unisex Names in US History

Moving on from the most trendy names in US history, …