Twitter data buffet is back in business

Posted to Data Sources  |  Nathan Yau

Almost a year and a half ago, Infochimps, the data repository slash marketplace, released a giant scrape of Twitter data representing 2.7 million users, 10 million tweets, and 58 million connections. Twitter soon requested that they take it down while they figured out how they wanted to handle licensing, privacy, etc.

That was in 2008, before Twitter really started booming. Fast forward to now. Twitter and Infochimps have figured out what they want to do, and the Twitter census data is back up. It’s no longer a measly 2.7 million users anymore though. The population has grown to 35 million.

This time around, instead of one big data dump, Infochimps provides large datasets for several metrics. Some are free. Some are not. Since there’s no easy way to split up free from non-free or sort by price on Infochimps, I’ve saved you the trouble and separated it for you.

Here are the free ones:

These will cost you, ranging from $20 to all the way up to $800. Generally speaking, the free data is a subset of the paid data.

So there you go. No more wasting time trying to get crafty with the Twitter API limits. It’s all there at your disposal. Now what are you going to do with it?

1 Comment

  • Very interesting stuff! The one thing I’m really interested in learning about Twitter users is browser stats: what browser they’re using, what their screen resolutions are, etc.. Is this data out there anywhere?

Favorites

How You Will Die

So far we’ve seen when you will die and how other people tend to die. Now let’s put the two together to see how and when you will die, given your sex, race, and age.

Real Chart Rules to Follow

There are rules—usually for specific chart types meant to be read in a specific way—that you shouldn’t break. When they are, everyone loses. This is that small handful.

Top Brewery Road Trip, Routed Algorithmically

There are a lot of great craft breweries in the United States, but there is only so much time. This is the computed best way to get to the top rated breweries and how to maximize the beer tasting experience. Every journey begins with a single sip.

Think Like a Statistician – Without the Math

I call myself a statistician, because, well, I’m a statistics graduate student. However, the most important things I’ve learned are less formal, but have proven extremely useful when working/playing with data.