What is Data and Why Do We Care About it So Much?

Posted to Miscellaneous  |  Nathan Yau

What is Data and Why Should We Care About It?I’ve been fortunate to have worked with people from lots of different fields – statistics, ecology, computer science, engineering, design, etc. If I’ve learned anything, it’s that everyone has a different idea of what data is and why it matters.

I’ve found that until I’ve understood what my collaborators mean by data and what they (and me) are trying to get out of a dataset, it’s near impossible to get anything useful done.

To make things a bit more clear (and for my own enjoyment), I asked a select group of people a single question:

What is data and why should we care about it?

Those who responded are from different areas of expertise, ranging from statistics, to business, to computer science, to design. Some names you’ll recognize while others will be new to you. All are doing interesting things with data.

I’ve been looking forward to this series for a couple of weeks now, and my hope is that you will gain a better understanding about what data is and how people are putting it to use. Keep an eye out for posts with the black square image above.

Here is who has answered so far:

If you’d like to answer the question yourself, I’d love to see your response too, or if you write an answer on your own blog, please do post the link in the comments below.



Real Chart Rules to Follow

There are rules—usually for specific chart types meant to be read in a specific way—that you shouldn’t break. When they are, everyone loses. This is that small handful.

Pizza Place Geography

Most of the major pizza chains are within a 5-mile radius of where I live, so I have my pick, …

The Best Data Visualization Projects of 2011

I almost didn’t make a best-of list this year, but as I clicked through the year’s post, it was hard …

Reviving the Statistical Atlas of the United States with New Data

Due to budget cuts, there is no plan for an updated atlas. So I recreated the original 1870 Atlas using today’s publicly available data.