Mapping GitHub – a network of collaborative coders

Posted to Network Visualization  |  Tags:  |  Nathan Yau

GitHub is a large community where coders can collaborate on software development projects. People check code in and out, make edits, etc. Franck Cuny maps this community (with Gephi), based on information in thousands of user profiles.

The above is a map colored and sorted by the main language of each person (PHP, Python, Perl, Javascript, or Ruby).

Cuny then looks at the structure within the coding networks, which is the most interesting part of the project. The Python map, for example, shows several projects, with Django in the dominant center.

In contrast, the PHP map is a lot more segregated.

I do wish there were some labels for the clusters so that I knew what exactly I was looking at, but if you like, you can download the the files (bottom of post) and explore them in Gephi yourself. See the rest of the graphs over on Flickr.

[Thanks, Steven]

13 Comments

Favorites

The Best Data Visualization Projects of 2011

I almost didn’t make a best-of list this year, but as I clicked through the year’s post, it was hard …

Years You Have Left to Live, Probably

The individual data points of life are much less predictable than the average. Here’s a simulation that shows you how much time is left on the clock.

Top Brewery Road Trip, Routed Algorithmically

There are a lot of great craft breweries in the United States, but there is only so much time. This is the computed best way to get to the top rated breweries and how to maximize the beer tasting experience. Every journey begins with a single sip.

The Best Data Visualization Projects of 2014

It’s always tough to pick my favorite visualization projects. Nevertheless, I gave it a go.