Mapping GitHub – a network of collaborative coders

Posted to Network Visualization  |  Tags:  |  Nathan Yau

GitHub is a large community where coders can collaborate on software development projects. People check code in and out, make edits, etc. Franck Cuny maps this community (with Gephi), based on information in thousands of user profiles.

The above is a map colored and sorted by the main language of each person (PHP, Python, Perl, Javascript, or Ruby).

Cuny then looks at the structure within the coding networks, which is the most interesting part of the project. The Python map, for example, shows several projects, with Django in the dominant center.

In contrast, the PHP map is a lot more segregated.

I do wish there were some labels for the clusters so that I knew what exactly I was looking at, but if you like, you can download the the files (bottom of post) and explore them in Gephi yourself. See the rest of the graphs over on Flickr.

[Thanks, Steven]

13 Comments

Favorites

Reviving the Statistical Atlas of the United States with New Data

Due to budget cuts, there is no plan for an updated atlas. So I recreated the original 1870 Atlas using today’s publicly available data.

Most popular porn searches, by state

We’ve seen that we can learn from what people search for, through the eyes of Google suggestions: state stereotypes, national …

Top Brewery Road Trip, Routed Algorithmically

There are a lot of great craft breweries in the United States, but there is only so much time. This is the computed best way to get to the top rated breweries and how to maximize the beer tasting experience. Every journey begins with a single sip.

Real Chart Rules to Follow

There are rules—usually for specific chart types meant to be read in a specific way—that you shouldn’t break. When they are, everyone loses. This is that small handful.