Finding Weirdness in Temperature Data

Posted to Mistaken Data  |  Nathan Yau

After parsing Weather Underground pages to grab temperature data, it’s time to look at the data. Can’t download all that data and not do anything with it!

First off, in my initial pass of my parsing script, I accidentally cut the month range short, so I didn’t get any data for December from 1980 to 2005. It should be noted that these plots don’t show this missing data. Um, there’s no axes or labels either. Sorry, I got a little lazy, but that’s not the point now anyways.

Notice anything weird about the above plot? There’s some unusually smooth data in the middle. Here’s a zoom in:

Wunder: Inconsistency Highlighted

If we look at the data between 1994 and 1997, there’s oddly a lot of smoothness… hmm… HMMM.

It looks like between that time, there was some interpolation going on. I mean, if that’s all you got, that’s all you got, but I wish WU would at least make note of it or provide some annotation.

Anyways, just another example of data posing to be something else. In my opinion, all data sucks until proven worthwhile.

Favorites

Life expectancy changes

The data goes back to 1960 and up to the most current estimates for 2009. Each line represents a country.

Think Like a Statistician – Without the Math

I call myself a statistician, because, well, I’m a statistics graduate student. However, the most important things I’ve learned are less formal, but have proven extremely useful when working/playing with data.

The Changing American Diet

See what we ate on an average day, for the past several decades.

Jobs Charted by State and Salary

Jobs and pay can vary a lot depending on where you live, based on 2013 data from the Bureau of Labor Statistics. Here’s an interactive to look.