Why Does Data Matter to Google?

Posted to Miscellaneous  |  Nathan Yau

Data is absolutely vital to Google’s success; without data, Google is pretty much useless when it comes to search. Hal Varian explains on the official Google blog:

Over the years, Google has continued to invest in making search better. Our information retrieval experts have added more than 200 additional signals to the algorithms that determine the relevance of websites to a user’s query.

So where did those other 200 signals come from? What’s the next stage of search, and what do we need to do to find even more relevant information online?

What an interesting question. I wonder what the answer is. Oh, here it is:

Storing and analyzing logs of user searches is how Google’s algorithm learns to give you more useful results. Just as data availability has driven progress of search in the past, the data in our search logs will certainly be a critical component of future breakthroughs.

Cashing In On Data

That’s right. Without data, who knows where search could be now. AOL might still be prosperous. There’s also this funny bit about how Larry and Sergey initially tried to license their algorithm to new, already existing search engines, but no one bit, and so they made their own. You gotta respect the data!

For more on the importance of data, you might also be interested in the ever-going series on FlowingData on why data matters.

Favorites

Divorce Rates for Different Groups

We know when people usually get married. We know who never marries. Finally, it’s time to look at the other side: divorce and remarriage.

Real Chart Rules to Follow

There are rules—usually for specific chart types meant to be read in a specific way—that you shouldn’t break. When they are, everyone loses. This is that small handful.

Life expectancy changes

The data goes back to 1960 and up to the most current estimates for 2009. Each line represents a country.

Years You Have Left to Live, Probably

The individual data points of life are much less predictable than the average. Here’s a simulation that shows you how much time is left on the clock.