Resources to Find the Data You Need, 2016 Edition

Before you get started on any data-related project, you need data. I know. It sounds crazy, but it’s the truth. It can be frustrating to sleuth for the data you need, so here are some tips on finding it (the openly available variety) and some topic-specific resources to begin your travels.

This is an update to the guide I wrote in 2009, which as it turns out, is now mostly outdated. So, 2016. Here we go.


I’ve been spending more time than usual in this area. The most reliable data comes from government organizations such as the Census Bureau and the Bureau of Labor Statistics. At least, this is the case in the United States. I can’t speak for other countries’ organizations.

The challenge is to sift through outdated government websites and portals with meaningless acronyms. was an effort to relieve some of the pain, but I’ve found it much more helpful to go through Google.


  • Census Bureau — They provide general population data and breakdowns, along with much more information via the American Community Survey. Their downloading portal, American FactFinder, does take some getting used to, but there’s a lot of data available there at your fingertips. Unless you want more historical data pre-2000. In this case, Google search is your friend, because even if it’s on the Census site somewhere, you probably won’t find it through the web navigation.
  • IPUMS — Maintained by the Minnesota Population Center, this makes Census microdata much easier to download.
  • Bureau of Labor Statistics
  • Organisation for Economic Co-operation and Development — For global indicators.
  • UNdata — Also for global indicators.


Health data was supposed to be easy to access years ago with various government initiatives, but that never quite caught on. There are sites, but they’re not especially usable. Instead, it’s typically easier to go to the source.



There’s been a growing number of places to download geographic data it seems, but there are a handful that have been around for a while and remain reliable.



If you’re interested in exploring the data that you see in a news graphic, reputable news sources provide their source either in the graphic or in the accompanying text. Usually it doesn’t lead you directly to the source, but it tells you where to look.



You’d think there would be a ton of sports data available with Sabermetrics taking off, but most of that data is private and cost a lot. You can still download historical box score type data though pretty easily.


General Purpose

There was a time when a whole bunch of data sites existed to share and upload data, but those are gone or no longer exist in their original format. Still, there are some.


Scraping and APIs

Then there are the less formatted routes to getting data. You can scrape data from websites or make use of openly available APIs. I use the Google APIs sometimes and still use the Python library BeautifulSoup every now and then.

This route requires some programming, but it can be worth a look.

General Tips in Your Search

This isn’t a comprehensive list. Not even close. But it should be enough to get you started and give you an idea for where to search. My searches typically start at Google, trying to be as specific as possible. If I don’t find anything at first, then I go more and more general.

Sometimes I’ll look for related data graphics that people made and then see if they provide their source. It’s customary these days. Side note: If there isn’t a source listed, you probably shouldn’t trust the graphic. But yeah, there is so much visualization out there that you can usually find something.

Finally, if you’re just looking for inspiration or for some data to poke at, there is always the Data Sources category here on FlowingData.

Become a member. Support an independent site. Make great charts.

See What You Get

Learn to Visualize Data See All →

A Course for Visualizing Time Series Data in R

Learn to visualize temporal patterns in a couple of days.

How to Make a Bump Chart in R, with ggplot

Visualize rankings over time instead of absolute values to focus on order instead of the magnitude of change.

How to map connections with great circles

There are various ways to visualize connections, but one of the most intuitive and straightforward ways is to actually connect entities or objects with lines. And when it comes to geographic connections, great circles are a nice way to do this.

Loading Data and Basic Formatting in R

It might not be sexy, but you have to load your data and get it in the right format before you can visualize it. Here are the basics, which might be all you need.


Finding the New Age, for Your Age

You’ve probably heard the lines about how “40 is the new 30” or “30 is the new 20.” What is this based on? I tried to solve the problem using life expectancy data. Your age is the new age.

Where Bars Outnumber Grocery Stores

A closer look at the age old question of where there are more bars than grocery stores, and vice versa.

Graphical perception – learn the fundamentals first

Before you dive into the advanced stuff – like just about everything in your life – you have to learn the fundamentals before you know when you can break the rules.

How Much Americans Make

Median income only tells you where the middle is. The distributions of income are a lot more interesting.