Graphwise: Crawling the Web for Tabulated Data

Posted to Apps  |  Nathan Yau

Graphwise LogoGraphwise launched a few weeks ago, but I’m just hearing about it now, so I guess there hasn’t been a whole lot of buzz about this new application.

The Graphwise group has got a spider crawling the Web for data in HTML data tables and as a result, has accumulated a pretty big data warehouse. There’s currently 2,766,560 extracted tables in the Graphwise database. That’s pretty good, and I think they’re building on a pretty good idea. However, Graphwise advertises itself as three pieces of a three-piece puzzle — get data, visualize, and share.

To say the least, the visualize and share portions need work. Here’s a visualization from the front page:

Graphwise Graph Example

I…I…don’t know what to say. Why the 3-d bars with the gradient background and the giant, semi-transparent Earth in the foreground blocking everything? It makes me want to throw up. It seriously looks like someone threw up data on the screen — data vomit. The javascript-enabled graphs seem to be making the browsing experience pretty sluggish too.

Am I being too harsh? My conscious is yelling at me for calling the graphs regurgitated food.

OK, OK. So to sum things up — the data warehousing and Web crawling are great. The spiders are clearly doing their job, so thumbs up for that. As for the visualizations, I, well, uh, it needs work (along with all the other junk that comes with running these types of data-centric applications).

[via Swivel]

2 Comments

Favorites

Life expectancy changes

The data goes back to 1960 and up to the most current estimates for 2009. Each line represents a country.

Think Like a Statistician – Without the Math

I call myself a statistician, because, well, I’m a statistics graduate student. However, the most important things I’ve learned are less formal, but have proven extremely useful when working/playing with data.

Years You Have Left to Live, Probably

The individual data points of life are much less predictable than the average. Here’s a simulation that shows you how much time is left on the clock.

Reviving the Statistical Atlas of the United States with New Data

Due to budget cuts, there is no plan for an updated atlas. So I recreated the original 1870 Atlas using today’s publicly available data.