versus – Which wins?

Posted to Data Sources  |  Tags:  |  Nathan Yau

Back in May last year, the US government launched as a statement of transparency, and the Internet rejoiced. After the launch, excitement kind of fizzled with the actual site, but big cities like San Francisco, New York, and Toronto got in on the open data party.

Then just a couple of weeks ago, launched, which brought me back to the US counterpart. How do the two compare? Here’s my take.

Behind the Application

The two applications are very similar on the surface. They catalog government data. Look a little closer though, and you’ll see that they’re actually really different in purpose, design, and end results. These key differences stem directly from those who were involved in the creation of each. According to FAQ, it was developed by the Federal CIO Council, as in Chief Information Officers Council. Here’s their main role in government, which sounds a lot like information science:

The CIO Council serves as the principal interagency forum for improving practices in the design, modernization, use, sharing, and performance of Federal Government agency information resources. It was a much more tech-oriented operation from the UK side with Tim Berners-Lee and Nigel Shadbolt advising. Shadbolt is a professor of artificial intelligence at the University of Southampton and Berners-Lee is credited with inventing the World Wide Web.

Winner: Toss up. While the computer science backgrounds can lead to good implementation, the information science crew took on organizing the many many US agencies. However the UK had the Web inventor. Yeah, gotta give the edge to the UK.


Now let’s get into the actual data. When first launched there were only forty something datasets, but the collection has since grown to several hundred., on the other hand, started with a few hundred. It links directly to data files in various formats including CSV, XML, Excel, and KML. A lot seems to be lacking though. For example, there’s no basic demographic data like population from the Census Bureau. You think that’s where they’d start. Maybe the Open Government Directive, which instructs Executive departments to publish three “high-value” data sets, might help this along. Instead of hosting the data, the UK took a link catalog approach. To the end user this doesn’t make a huge difference. As long as you get the data, you’re good, but from a developer standpoint, it’s a lot easier to catalog links than files.

Winner: A quick browse through the available data sets on both sites will show you the wider range of topics on

Design and Usability

One looks like a government website. The other sports a more modern design. At launch, didn’t look all that bad to me, but then I tried to use the site and find some data. The main flaw I saw was in the data browser. I wasn’t looking for a particular data set. I just wanted to browse, so I’d select an agency and search. The problem is that a lot of the listed agencies don’t have any data cataloged. Navigation through the site was very familiar using common netspeak like tags and apps. I felt more engaged.

Winner: Both could use a data preview though.

Projects Other than the “join the dialogue” section on a separate domain and the pitch on the homepage, makes little effort to highlight or promote any projects that use the data from the site. The focus is on a repository. What you do with the data doesn’t seem to matter much. At least Sunlight Labs is making an effort. Lists recent apps and has an idea submission section. They clearly want you to use the data with a developer-centric approach.

Winner: It’s all about engaging with data over creating a catalog.

Bottom Line

While was just recently launched publicly, it has many advantages over It’s easier to use and geared towards developers, who, let’s face it, are the only ones who are going to do more with the data than open it up in Excel. has some catching up to do. Both still have a long way to go. Both are good steps in the right direction.



Think Like a Statistician – Without the Math

I call myself a statistician, because, well, I’m a statistics graduate student. However, the most important things I’ve learned are less formal, but have proven extremely useful when working/playing with data.

Marrying Age

People get married at various ages, but there are definite trends that vary across demographic groups. What do these trends look like?

The Best Data Visualization Projects of 2011

I almost didn’t make a best-of list this year, but as I clicked through the year’s post, it was hard …

Graphical perception – learn the fundamentals first

Before you dive into the advanced stuff – like just about everything in your life – you have to learn the fundamentals before you know when you can break the rules.