Category: Statistical Visualization

  • Statistical Atlas from the ninth Census in 1870

    Posted Mar 16, 2010 to Statistical Visualization / 9 comments

    Statistical Atlas from the ninth Census in 1870

    In 1870, Francis Walker oversaw publication of the United States' first Statistical Atlas, based on data from the ninth Census. It was a big moment for statistics in the United States as the atlas provided a way to compare data on a national level using maps and statistical graphics.

    What continues to amaze me about these old illustrations is the detail - all done by hand. That's ridiculous. The kicker is that a lot of this stuff looks way better than a lot of what we see nowadays. Here are some selections from the 1870 atlas.
    Continue Reading

  • Canada: the country that pees together stays together

    Posted Mar 9, 2010 to Statistical Visualization / 21 comments

    Canada: the country that pees together stays together

    EPCOR, the water utility company that runs the fountains up in Edmonton, Canada released this graph yesterday. It's water consumption during the Olympic gold medal hockey game, overlaying consumption of the previous day. How much do Canadians love their hockey? A lot.

    The first period ends. Time to pee. The second period ends. Time to pee. The third period ends. Time to pee. Consumption goes way down when Canada wins and during the medal ceremony.

    Finally, when it's all said and done, the rest of the country can relieve itself, figuratively and literally.

    [via contrarian | thanks, @statpumpkin]

  • Challenge: make this graph easier to read

    The Economist discusses the return of big government and includes this graph showing total government spending as a percentage of Gross Domestic Product. We see a dip in 2000 and a big jump this past year.

    The trouble is that the country labels are cluttered. If you read them left to right, you get mixed up initially. Keep your eyes left and move top to bottom, and you might be okay.

    The Challenge

    Can you think of a way to make this graph easier to read? Is there a better way to represent the time series?

    One catch: you have to work within the size limitation of 290 pixels wide and 300 pixels tall. It's an easy fix with unlimited space. But what can you do when space is scarce? Leave your thoughts in the comments below.

    P.S. I was looking for the data this graph uses but got tired of using the OECD stat browser, so we'll just have to use our imagination for this one.

    [Thanks, Justin]

    Update: Here's GDP (sans spending) by country from 1995 to 2008 if anyone would like to take a wack [thanks, Kim].

  • An Exploration of Biological Records

    Posted Feb 25, 2010 to Statistical Visualization / 1 comment

    An Exploration of Biological Records

    The Natural Science Museum of Barcelona has a growing database of 50,000 records of specimens collected over the past 150 years. Bestiario explores this data in their biodiversity treemap and geographical map.

    The cool thing about the treemap is that you can zoom in iteratively through the Phylums, Classes, Orders, Families, etc. The interaction is similar to NYT's treemap that showed Obama's budget proposal from earlier this month, except there are a lot more levels, so you zoom, and then you zoom some more.

    [Thanks, Jose]

  • Road to Recovery – Is the Recovery Act working?

    Road to Recovery – Is the Recovery Act working?

    The Obama administration just posted a graph showing monthly job loss from December 2007 (Bush in red) up to last month. Discuss.

    [via @nickbilton]

    Update: There's a video version now [via infosthetics].

  • Build Online Visualization for Free with Tableau Public

    Tableau Software, popular for making data more accessible, mainly in the business sector, just opened up with Tableau Public. The application is similar in spirit to other online data applications like Many Eyes and Swivel. It lets you share data and visualizations online. However, Tableau Public doesn't have a central portal or a place to browse data. Rather it's focused on letting you explore data and stitch modules together on your desktop and then embed your findings on a website or blog.
    Continue Reading

  • An Easy Way to Make a Treemap

    Posted Feb 11, 2010 to Statistical Visualization, Tutorials / 47 comments

    Back in 1990, Ben Shneiderman, of the University of Maryland, wanted to visualize what was going on in his always-full hard drive. He wanted to know what was taking up so much space. Given the hierarchical structure of directories and files, he first tried a tree diagram. It got too big too fast to be useful though. Too many nodes. Too many branches.

    The treemap was his solution. It's an area-based visualization where the size of each rectangle represents a metric since made popular by Martin Wattenberg's Map of the Market and Marcos Weskamp's newsmap.

    Here's a really easy way to make your own treemap in just a couple lines of code. We're looking to make something like this:


    Continue Reading

  • Obama’s Budget Proposal and Incorrect Forecasts

    Posted Feb 1, 2010 to Statistical Visualization / 5 comments

    Obama’s Budget Proposal and Incorrect Forecasts

    President Obama announced his 2011 budget proposal. How does it compare to last year's budget? Shan Carter and Amanda Cox of The New York Times compare the two plans. Red indicates a decrease in the percentage of the budget dedicated to the respective area, and green is for growth. Zoom in for a better view of the smaller areas.

    Big decreases for Medicaid grants, veteran benefits, and unemployment insurance. Major increase for education, Medicare, and administration of justice.

    This of course takes budget forecasting into account, which Ms. Cox shows isn't all that accurate sometimes.

    [via @nytgraphics]

  • How to Make a Heatmap – a Quick and Easy Solution

    How do you make a heatmap? This came from kerimcan in the FlowingData forums, and krees followed up with a couple of good links on how to do them in R. It really is super easy. Here's how to make a heatmap, with just a few lines of code, like this.


    Continue Reading

  • Build Statistical Graphics Online With ggplot2

    Build Statistical Graphics Online With ggplot2

    Statisticians are generally behind the times when it comes to online applications. There are a lot out-dated Java applets and really rough attempts at getting R, a statistical computing environment, in some useful form through a browser. So imagine my surprise when I tried this tool by Jeroen Ooms, a visiting scholar at UCLA Statistics.

    It actually works pretty well, and for a prototype, it isn't half bad.
    Continue Reading

  • How to Make an Interactive Area Graph with Flare

    You've seen the NameExplorer from the Baby Name Wizard by Martin Wattenberg. It's an interactive area chart that lets you explore the popularity of names over time. Search by clicking on names or typing in a name in the prompt. It's simple. It's sexy. Everybody loves it.

    This is a step-by-step guide on how to make a similar visualization in Actionscript/Flash with your own data and how to customize the design for whatever you need. We're after last week's graphic on consumer spending:

    consumer spending
    Continue Reading

  • Stat Charts Get a New York Times Redesign

    Posted Dec 3, 2009 to Statistical Visualization / 19 comments

    Stat Charts Get a New York Times Redesign

    Statistical graphics are often... kind of bland. But that's fine, because they're usually for analysis, and the wireframe does just fine. The time eventually comes though when you need to present your analytical visualization in a paper or some slides, and you're no longer the primary reader.

    In their NYT op-ed on health care calculations, Andrew Gelman, Nate Silver, and Daniel Lee had some graphics of their own that needed some NYT flavor and design treatment.
    Continue Reading

  • The Cost of Getting Sick

    Posted Nov 23, 2009 to Statistical Visualization / 18 comments

    The Cost of Getting Sick

    GE and Ben Fry (now the director of SEED visualization), show the cost of getting sick, from the individual's and insurer's perspective. The data: 500k records from the Medical Expenditure Panel Survey from GE's proprietary database. The visualization: a polar area pie chart.
    Continue Reading

  • Unemployment Rate For People Like You – NYT Interactive

    Posted Nov 9, 2009 to Statistical Visualization / 12 comments

    Unemployment Rate For People Like You – NYT Interactive

    Shan Carter, Amanda Cox, and Kevin Quealy of The New York Times explore 12-month average unemployment rates for just about any breakdown you can imagine. The main point: not everyone has been affected by the recession equally, and here's how each group has felt it.

    Start with the filters up top for race, gender, age, and education level. The corresponding time series highlights blue.

    Change the filters - and here's where the graphic gets a lot of mileage - the lit line moves up or down and the vertical axis updates, depending on what you were originally looking at. That up and down movement makes comparison between demographic groups much easier, especially because there are so many time series on a single plot.

    I'm impressed, NYT. Again.

  • Facebook Measures Happiness in Status Updates

    Posted Oct 5, 2009 to Statistical Visualization / 23 comments

    Facebook Measures Happiness in Status Updates

    As we all know, Facebook lets people update their friends with status updates, and with millions of users, that's a lot of data. Look at the aggregated data over time, and you could see some interesting trends.

    The Facebook Data Team recently measured happiness in the United States based on these updates with a metric they call United States Gross National Happiness.

    Measuring how well-off, happy or satisfied with life the citizens of a nation are is part of the Gross National Happiness movement. This graph represents how "happy" the nation is doing from day to day, by looking at how many positive and negative words people are using when they update their status: When people are using more positive words (or fewer negative words) in their status updates than usual, that day is happier than usual!

    Browse the trends over time, and there's nothing earth-shattering really. You've got dips on the Mondays and peaks on holidays. Although I'm not sure what happened January 22, 2008 to make people so sad. EDIT: It was the day Heath Ledger died [Thanks, Amanda].

    Big picture though, I'm sure governments, businesses, organizations, etc would be more than pleased to have something like this when they made a new policy, launched a new product, or started a new initiative.

    That's probably why so many are fascinated with the publicly available data coming out of Twitter.

    [via TechCrunch]