Mapping GitHub – a network of collaborative coders

Posted to Network Visualization  |  Tags:  |  Nathan Yau

GitHub is a large community where coders can collaborate on software development projects. People check code in and out, make edits, etc. Franck Cuny maps this community (with Gephi), based on information in thousands of user profiles.

The above is a map colored and sorted by the main language of each person (PHP, Python, Perl, Javascript, or Ruby).

Cuny then looks at the structure within the coding networks, which is the most interesting part of the project. The Python map, for example, shows several projects, with Django in the dominant center.

In contrast, the PHP map is a lot more segregated.

I do wish there were some labels for the clusters so that I knew what exactly I was looking at, but if you like, you can download the the files (bottom of post) and explore them in Gephi yourself. See the rest of the graphs over on Flickr.

[Thanks, Steven]

13 Comments

Favorites

10 Best Data Visualization Projects of 2015

These are my picks for the best of 2015. As usual, they could easily appear in a different order on a different day, and there are projects not on the list that were also excellent.

Where People Run in Major Cities

There are many exercise apps that allow you to keep track of your running, riding, and other activities. Record speed, …

Years You Have Left to Live, Probably

The individual data points of life are much less predictable than the average. Here’s a simulation that shows you how much time is left on the clock.

A Day in the Life of Americans

I wanted to see how daily patterns emerge at the individual level and how a person’s entire day plays out. So I simulated 1,000 of them.