Visualizing Yahoo email in real-time

Posted to Visualization  |  Tags: , , ,  |  Nathan Yau

Hundreds of thousands of emails are sent every second, and yet, you wouldn’t really know it because there aren’t public-facing streams like that of Twitter. Outside your own inbox, how much email is there exactly? Yahoo, in collaboration with information visualization firm Periscopic, shows you how much email they process in real-time with this interactive feature.

The initial view is a world map, and scaled bubbles represent how many emails were currently sent. Hover over continents for user geographic distribution and gigabytes sent.

There’s also trending topics from anonymized subject headers via streamgraph. The view is interesting as you can click on sections so that the surrounding streams split, so you get a sense of distribution along with details per keyword. The keyword data, however, isn’t all that interesting for the most part. You’ll see keywords such as online, free, and nights. Not too meaningful. There are a few exceptions though like Oprah and wars.

There is also an option to include spam keywords with equally generic terms.

Finally, if you go back to the map and keep on clicking, you eventually get to some fun facts about email, such as there are over sextillion ways to spell Viagra.

All in all, it’s a comprehensive view of how much email Yahoo handles that’s fun to poke around. Turn on your speakers for playful sound effects.

[Visualizing Yahoo! Mail]

3 Comments

  • There can’t possibly be that many (10^21) misspellings of “viagra”. There are 26^6 6-letter “words” that can be made from a 26-letter alphabet — less than 10^9. It’d take a 15-letter word (26^15) or a 3000-character alphabet (3000^6) to be able to produce 10^21 different “words.” Somewhere in the middle, a 150-character alphabet with 10-character words could do it.

    And even then, the vast majority of those strings won’t resemble “viagra” at all…

Favorites

Years You Have Left to Live, Probably

The individual data points of life are much less predictable than the average. Here’s a simulation that shows you how much time is left on the clock.

How You Will Die

So far we’ve seen when you will die and how other people tend to die. Now let’s put the two together to see how and when you will die, given your sex, race, and age.

Think Like a Statistician – Without the Math

I call myself a statistician, because, well, I’m a statistics graduate student. However, the most important things I’ve learned are less formal, but have proven extremely useful when working/playing with data.

The Best Data Visualization Projects of 2014

It’s always tough to pick my favorite visualization projects. Nevertheless, I gave it a go.