The Wikimedia Foundation’s Analytics team is releasing a monthly clickstream dataset. The dataset represents—in aggregate—how readers reach a Wikipedia article and navigate to the next. Previously published as a static release, this dataset is now available as a series of monthly data dumps for English, Russian, German, Spanish, and Japanese Wikipedias.
Data to identify Wikipedia rabbit holes
Projects by Nathan Yau See All →
Do Movie Sequels Live Up to Their Originals?
The third installment of Pixar’s Toy Story is making a …
Unemployment in America, Mapped Over Time
Watch the regional changes across the country from 1990 to 2016.
A Day in the Life of Americans
I wanted to see how daily patterns emerge at the individual level and how a person’s entire day plays out. So I simulated 1,000 of them.