Neural Network for selfie analysis

Posted to Statistics  |  Tags: ,  |  Nathan Yau

To introduce Convolutional Neural Networks, Andrej Karpathy looked at millions of selfies, left the computer to its own devices, and tried to find what makes a good selfie.

Okay, so we collected 2 million selfies, decided which ones are probably good or bad based on the number of likes they received (controlling for the number of followers), fed all of it to Caffe and trained a ConvNet. The ConvNet “looked” at every one of the 2 million selfies several tens of times, and tuned its filters in a way that best allows it to separate good selfies from bad ones. We can’t very easily inspect exactly what it found (it’s all jumbled up in 140 million numbers that together define the filters). However, we can set it loose on selfies that it has never seen before and try to understand what it’s doing by looking at which images it likes and which ones it does not.

Key tips: Use a filter and border, crop off your forehead, and most importantly, be a female. If male, broader selfies appear to be more ideal.

Naturally, there are questions to ask about cause-and-effect here. For example, in the neural network training, a selfie with a lot of likes qualifies as a good one. It might be that people with more followers tend more to a certain type of photo. Maybe they’re following a trend. Or maybe females take more selfies than men, which is why there are far more women that appear in the top.

In any case, that sort of misses the point. The point here is that neural networks can be fun and can output some interesting stuff. Be sure to scroll to the end of the article for resources and tools that you can play with.

Favorites

Most popular porn searches, by state

We’ve seen that we can learn from what people search for, through the eyes of Google suggestions: state stereotypes, national …

Reviving the Statistical Atlas of the United States with New Data

Due to budget cuts, there is no plan for an updated atlas. So I recreated the original 1870 Atlas using today’s publicly available data.

Top Brewery Road Trip, Routed Algorithmically

There are a lot of great craft breweries in the United States, but there is only so much time. This is the computed best way to get to the top rated breweries and how to maximize the beer tasting experience. Every journey begins with a single sip.

Shifting Incomes for American Jobs

For various occupations, the difference between the person who makes the most and the one who makes the least can be significant.