Jeopardy! clues data

Posted to Data Sources  |  Tags:  |  Nathan Yau

Here’s some weekend project data for you. Reddit user trexmatt dumped a dataset for 216,930 Jeopardy! questions and answers in JSON and CSV formats, a scrape from the J! Archive. Each clue is represented by category, money value, the clue itself, the answer, round, show number, and air date.

I’m not sure what I’d do with the data, but the first thing that comes to mind is investigating the hunt for Daily Doubles. Where are those things usually placed, and how random is it? Oh wait, someone already did that.

Have fun poking.

Favorites

How to Spot Visualization Lies

Many charts don’t tell the truth. This is a simple guide to spotting them.

Shifting Incomes for American Jobs

For various occupations, the difference between the person who makes the most and the one who makes the least can be significant.

Years You Have Left to Live, Probably

The individual data points of life are much less predictable than the average. Here’s a simulation that shows you how much time is left on the clock.

Pizza Place Geography

Most of the major pizza chains are within a 5-mile radius of where I live, so I have my pick, …