Lies people tell in online dating

Posted to Statistics  |  Nathan Yau

Online dating site OkCupid continues with amusing yet thorough analysis of their 1.51 million users. This time around, they cover the lies people tell:

People do everything they can in their OkCupid profiles to make themselves seem awesome, and surely many of our users genuinely are. But it’s very hard for the casual browser to tell truth from fiction. With our behind-the-scenes perspective, we’re able to shed some light on some typical claims and the likely realities behind them.

Among the findings:

  • People exaggerate their height by about two inches.
  • If someone says they make $100k per year, they probably mean $80k.
  • The more attractive a picture, the older it is.
  • Most self-identified bisexuals (80%) only like one gender.

Buyer beware.


  • John Morrow August 5, 2010 at 7:02 pm

    While I don’t doubt the exaggeration of height and salary on dating sites, I honestly dont know what the “average” distribution looks like for height in the US. Where did they get the source of this distribution?

  • Also, it is not reasonable to assume that OkCupid men are a random sample of U.S. men.

    – They probably are younger than the total population. Later generations are taller on average than older ones. Thus people on dating sites probably actually are taller.
    – Poor people use the internet less or not at all. So you can expect the users of a dating site to have higher incomes.

    How many more differences can you think of. It’s easy to find a dozen.

    • True, but with a sample size of 1.5 million, there’s some credibility to the argument.

      • Unfortunately not. If the sampling procedure is systematically biased as it is here (in the form of self selection and thus oversampling of specific groups), the sample size does not change anything at all.

        It //would// decreases the size of your confidence interval if it was a random sample, sure. But it is not and you simply cannot compare the two data sets unless you use some form of matching procedure.

    • Smack: right. And there could be basic differences between the general population and single people, too. E.g.: Are wealthy people more likely to be single in their 50’s? I wouldn’t expect that to be a big effect, but I wouldn’t be surprised if it’s there.

  • Pingback: dating


Top Brewery Road Trip, Routed Algorithmically

There are a lot of great craft breweries in the United States, but there is only so much time. This is the computed best way to get to the top rated breweries and how to maximize the beer tasting experience. Every journey begins with a single sip.

The Changing American Diet

See what we ate on an average day, for the past several decades.

Watching the growth of Walmart – now with 100% more Sam’s Club

The ever so popular Walmart growth map gets an update, and yes, it still looks like a wildfire. Sam’s Club follows soon after, although not nearly as vigorously.

One Dataset, Visualized 25 Ways

“Let the data speak” they say. But what happens when the data rambles on and on?