<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>FlowingData &#187; Projects</title>
	<atom:link href="http://flowingdata.com/category/projects/feed/" rel="self" type="application/rss+xml" />
	<link>http://flowingdata.com</link>
	<description>Strength in Numbers</description>
	<lastBuildDate>Fri, 10 Feb 2012 20:28:38 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
	<atom:link rel="next" href="http://flowingdata.com/category/projects/feed/?page=2" />

		<item>
		<title>Where are the biggest box office movies (not) streaming?</title>
		<link>http://flowingdata.com/2012/01/20/where-are-the-biggest-box-office-movies-streaming/</link>
		<comments>http://flowingdata.com/2012/01/20/where-are-the-biggest-box-office-movies-streaming/#comments</comments>
		<pubDate>Fri, 20 Jan 2012 08:03:47 +0000</pubDate>
		<dc:creator>Nathan Yau</dc:creator>
				<category><![CDATA[Projects]]></category>
		<category><![CDATA[featured]]></category>
		<category><![CDATA[movies]]></category>
		<category><![CDATA[netflix]]></category>
		<category><![CDATA[streaming]]></category>

		<guid isPermaLink="false">http://flowingdata.com/?p=21000</guid>
		<description><![CDATA[<p><a href="http://flowingdata.com/2012/01/20/where-are-the-biggest-box-office-movies-streaming/"><img width="625" height="497" src="http://flowingdata.com/wp-content/uploads/2012/01/streaming3-625x497.png" class="attachment-medium wp-post-image" alt="Big movie streaming" title="Big movie streaming" /></a></p>After seeing Tristan Louis' list that tallied the streaming availability of 2011's top 100 box office hits, I was curious what it looked like graphically.]]></description>
			<content:encoded><![CDATA[<p><a href="http://flowingdata.com/2012/01/20/where-are-the-biggest-box-office-movies-streaming/"><img width="625" height="497" src="http://flowingdata.com/wp-content/uploads/2012/01/streaming3-625x497.png" class="attachment-medium wp-post-image" alt="Big movie streaming" title="Big movie streaming" /></a></p><p>After seeing Tristan Louis' list that tallied the <a href="http://www.tnl.net/blog/2012/01/14/internet-vod-2011-movies/">streaming availability of 2011's top 100 box office hits</a>, I was curious what it looked like graphically. So I put together this little number. Blue means available, yellow means not, and gray means it's only available for purchase. The last column for DVD simply means it's available (since DVDs are of course not streaming).</p>
<p>Netflix streaming still isn't a place to find the big movies (as any Netflix customer can tell you), with only five of the top 100 available. There is greater streaming availability from iTunes, Amazon, and Vudu, but those of course aren't fair comparisons to Netflix, given that the latter is subscription-only.</p>
<p>My main takeaway is that if you're deciding between the non-subscription services, it looks like price is the main thing to look at, since there doesn't seem to be much variability in availability (although it could be different for smaller movies). As for Netflix, subscribe for the television and for the movies less so.</p>
<p>[<a href="http://www.tnl.net/blog/2012/01/14/internet-vod-2011-movies/">Tristan Louis</a> via <a href="http://waxy.org/links">Waxy</a>]</p>
]]></content:encoded>
			<wfw:commentRss>http://flowingdata.com/2012/01/20/where-are-the-biggest-box-office-movies-streaming/feed/</wfw:commentRss>
		<slash:comments>27</slash:comments>
		</item>
		<item>
		<title>Vehicles involved in fatal crashes</title>
		<link>http://flowingdata.com/2012/01/11/vehicles-involved-in-fatal-crashes/</link>
		<comments>http://flowingdata.com/2012/01/11/vehicles-involved-in-fatal-crashes/#comments</comments>
		<pubDate>Wed, 11 Jan 2012 10:44:20 +0000</pubDate>
		<dc:creator>Nathan Yau</dc:creator>
				<category><![CDATA[Projects]]></category>
		<category><![CDATA[calendar]]></category>
		<category><![CDATA[fatal crashes]]></category>
		<category><![CDATA[featured]]></category>

		<guid isPermaLink="false">http://flowingdata.com/?p=20737</guid>
		<description><![CDATA[<p><a href="http://flowingdata.com/2012/01/11/vehicles-involved-in-fatal-crashes/"><img width="625" height="778" src="http://flowingdata.com/wp-content/uploads/2012/01/calendar5.png" class="attachment-medium wp-post-image" alt="Vehicles involved in fatal crashes" title="Vehicles involved in fatal crashes" /></a></p>After seeing a map in The Guardian, I was curious about what other data was available from the National Highway Traffic Safety Association. It turns out there's a lot.]]></description>
			<content:encoded><![CDATA[<p><a href="http://flowingdata.com/2012/01/11/vehicles-involved-in-fatal-crashes/"><img width="625" height="778" src="http://flowingdata.com/wp-content/uploads/2012/01/calendar5.png" class="attachment-medium wp-post-image" alt="Vehicles involved in fatal crashes" title="Vehicles involved in fatal crashes" /></a></p><p>After seeing <a href="http://flowingdata.com/2011/11/29/us-road-fatalities-mapped-9-years/">this map</a> on <em>The Guardian</em>, I was curious about what other data was available from the National Highway Traffic Safety Administration. It turns out there's a lot and it's relatively easy to access via FTP. What's most surprising is that it's detailed and fairly complete, with columns for weather, number of people involved, date and time of accidents, and a lot more.</p>
<p>The above shows vehicles involved in fatal crashes in 2010 (which is different from number of crashes or number of fatalities). This data was just released last month, at the end of 2011 oddly enough. It's a calendar view with months stacked on top of one another and darker days indicate more vehicles involved.</p>
<p>Nearly every single data point also has location attached to it, so I tried some mapping, but they look like population density more or less. Here's one that shows crashes that occurred on local roads (orange) and those on freeways, highways, etc (blue). Road patterns start to come out for the major interstates.</p>
<p><img src="http://flowingdata.com/wp-content/uploads/2012/01/roadtype.png" alt="" title="Crashes by road type" width="625" height="399" class="alignnone size-full wp-image-20739" /></p>
<p>I also mapped by weather (<a href="http://flowingdata.com/wp-content/uploads/2012/01/rain.png">rain</a>, <a href="http://flowingdata.com/wp-content/uploads/2012/01/fog.png">fog</a>, <a href="http://flowingdata.com/wp-content/uploads/2012/01/snow.png">snow</a>), time of day (<a href="http://flowingdata.com/wp-content/uploads/2012/01/hour1.png">midnight to 4am</a>, <a href="http://flowingdata.com/wp-content/uploads/2012/01/hour2.png">4-8</a>, <a href="http://flowingdata.com/wp-content/uploads/2012/01/hour3.png">8-noon</a>, <a href="http://flowingdata.com/wp-content/uploads/2012/01/hour4.png">noon to 4</a>, <a href="http://flowingdata.com/wp-content/uploads/2012/01/hour5.png">4 to 8</a>, <a href="http://flowingdata.com/wp-content/uploads/2012/01/hour6.png">and 8 to midnight</a>), and <a href="http://flowingdata.com/wp-content/uploads/2012/01/drunks.png">crashes due to drunk driving</a> but it wasn't as interesting as the day-by-day. </p>
<p>If you're a teacher looking for data to use with an assignment or just want to practice, this is a good set, despite the somber topic. You can find the data <a href="http://www-fars.nhtsa.dot.gov/Main/reportslinks.aspx">here</a>, and there's an FTP link in the footer of the page to download more detailed data. You'll also need <a href="http://www-nrd.nhtsa.dot.gov/Pubs/811529.pdf">this guide</a> [pdf] that defines all the variables.</p>
]]></content:encoded>
			<wfw:commentRss>http://flowingdata.com/2012/01/11/vehicles-involved-in-fatal-crashes/feed/</wfw:commentRss>
		<slash:comments>28</slash:comments>
		</item>
		<item>
		<title>Life expectancy changes</title>
		<link>http://flowingdata.com/2011/10/13/life-expectancy-changes/</link>
		<comments>http://flowingdata.com/2011/10/13/life-expectancy-changes/#comments</comments>
		<pubDate>Thu, 13 Oct 2011 10:50:50 +0000</pubDate>
		<dc:creator>Nathan Yau</dc:creator>
				<category><![CDATA[Projects]]></category>
		<category><![CDATA[featured]]></category>
		<category><![CDATA[interactive]]></category>
		<category><![CDATA[life expectancy]]></category>
		<category><![CDATA[time series]]></category>

		<guid isPermaLink="false">http://flowingdata.com/?p=19290</guid>
		<description><![CDATA[<p><a href="http://flowingdata.com/2011/10/13/life-expectancy-changes/"><img width="625" height="452" src="http://flowingdata.com/wp-content/uploads/2011/10/Life-expectancy-625x452.png" class="attachment-medium wp-post-image" alt="Life expectancy" title="Life expectancy" /></a></p>I played around with D3 some more. This time I used data from The World Bank to look at life &#8230;]]></description>
			<content:encoded><![CDATA[<p><a href="http://flowingdata.com/2011/10/13/life-expectancy-changes/"><img width="625" height="452" src="http://flowingdata.com/wp-content/uploads/2011/10/Life-expectancy-625x452.png" class="attachment-medium wp-post-image" alt="Life expectancy" title="Life expectancy" /></a></p><p>I played around with <a href="http://mbostock.github.com/d3/">D3</a> some more. This time I used data from The World Bank to look at <a href="http://projects.flowingdata.com/life-expectancy/">life expectancy over time and by country</a>. The data goes back to 1960 and up to the most current estimates for 2009. Each line represents a country, and you can see a few more details by rolling over each one.</p>
<p>More usefully, you can select regions, as defined by The World Bank, and see how groups of countries have changed or compare with other regions. For example, the shot above shows East Asia and Pacific in pink and Sub-Saharan Africa in green.</p>
<p>The major dips for some of the countries jumped out at me immediately. A few quick searches suggest that they coincide with war, such as the Bangladesh Liberation War in the 1970s, the Iran-Iraq War in the 1980s, and the Rwanda Civil War in the 1990s. There's also a smaller dip for Iraq starting in early 2000.</p>
<p>Some countries, such as Seychelles, didn't have data that went back to 1960. In this case, the line starts where annual data became available.</p>
<p>There are a few interactions that I'd add to make the graphic more complete, but don't have time to get to right now. For example, as I clicked around, I wanted to keep a selected country highlighted to see how it compared to others. It'd be informative to point out periods of war on the graphic, too.</p>
<p>See the <a href="http://projects.flowingdata.com/life-expectancy/">full interactive here</a>, and as usual, comments are welcome.</p>
<p>[<a href="http://projects.flowingdata.com/life-expectancy/">Life Expectancy</a>]</p>
]]></content:encoded>
			<wfw:commentRss>http://flowingdata.com/2011/10/13/life-expectancy-changes/feed/</wfw:commentRss>
		<slash:comments>24</slash:comments>
		</item>
		<item>
		<title>Who does all the text messaging? Young adults by far.</title>
		<link>http://flowingdata.com/2011/09/26/who-does-all-the-text-messaging-young-adults-by-far/</link>
		<comments>http://flowingdata.com/2011/09/26/who-does-all-the-text-messaging-young-adults-by-far/#comments</comments>
		<pubDate>Mon, 26 Sep 2011 07:29:39 +0000</pubDate>
		<dc:creator>Nathan Yau</dc:creator>
				<category><![CDATA[Projects]]></category>
		<category><![CDATA[featured]]></category>
		<category><![CDATA[Pew Research]]></category>
		<category><![CDATA[texting]]></category>

		<guid isPermaLink="false">http://flowingdata.com/?p=19041</guid>
		<description><![CDATA[<p><a href="http://flowingdata.com/2011/09/26/who-does-all-the-text-messaging-young-adults-by-far/"><img width="625" height="847" src="http://flowingdata.com/wp-content/uploads/2011/09/texting1.png" class="attachment-medium wp-post-image" alt="Who does all the texting?" title="Who does all the texting?" /></a></p>The Pew Internet and American Life Project published the results of their texting study, showing that young adults text more &#8230;]]></description>
			<content:encoded><![CDATA[<p><a href="http://flowingdata.com/2011/09/26/who-does-all-the-text-messaging-young-adults-by-far/"><img width="625" height="847" src="http://flowingdata.com/wp-content/uploads/2011/09/texting1.png" class="attachment-medium wp-post-image" alt="Who does all the texting?" title="Who does all the texting?" /></a></p><p>The Pew Internet and American Life Project published the results of their <a href="http://pewinternet.org/Reports/2011/Cell-Phone-Texting-2011.aspx">texting study</a>, showing that young adults text more than anyone else. The report refers to a lot of averages across demographics, but it seems that there were a lot of heavy texters driving up those averages. The medians are a lot lower. The chart above shows the latter.</p>
<p>Even the median for young adults is still high though relative to other groups.</p>
<p>At 29, I'm right at the edge of that young adult group, and I text maybe once or twice a month on average. Forty per day seems outrageously high. Kids these days. </p>
]]></content:encoded>
			<wfw:commentRss>http://flowingdata.com/2011/09/26/who-does-all-the-text-messaging-young-adults-by-far/feed/</wfw:commentRss>
		<slash:comments>11</slash:comments>
		</item>
		<item>
		<title>How do Americans spend their days?</title>
		<link>http://flowingdata.com/2011/09/20/how-do-americans-spend-their-days/</link>
		<comments>http://flowingdata.com/2011/09/20/how-do-americans-spend-their-days/#comments</comments>
		<pubDate>Tue, 20 Sep 2011 07:23:31 +0000</pubDate>
		<dc:creator>Nathan Yau</dc:creator>
				<category><![CDATA[Projects]]></category>
		<category><![CDATA[D3]]></category>
		<category><![CDATA[featured]]></category>
		<category><![CDATA[interactive]]></category>
		<category><![CDATA[time use]]></category>

		<guid isPermaLink="false">http://flowingdata.com/?p=18898</guid>
		<description><![CDATA[<p><a href="http://flowingdata.com/2011/09/20/how-do-americans-spend-their-days/"><img width="625" height="497" src="http://flowingdata.com/wp-content/uploads/2011/09/How-Americans-spend-their-day-full1-625x497.png" class="attachment-medium wp-post-image" alt="How Americans spend their day -full" title="How Americans spend their day -full" /></a></p>One of my favorite data graphics is an interactive piece by The New York Times that shows how Americans spend &#8230;]]></description>
			<content:encoded><![CDATA[<p><a href="http://flowingdata.com/2011/09/20/how-do-americans-spend-their-days/"><img width="625" height="497" src="http://flowingdata.com/wp-content/uploads/2011/09/How-Americans-spend-their-day-full1-625x497.png" class="attachment-medium wp-post-image" alt="How Americans spend their day -full" title="How Americans spend their day -full" /></a></p><p>One of my favorite data graphics is an <a href="http://flowingdata.com/2009/08/10/how-people-in-america-spend-their-day/">interactive piece</a> by <em>The New York Times</em> that shows how Americans spend their day, based on the American Time Use Survey (ATUS). I've also been wanting to play with Mike Bostock's <a href="http://flowingdata.com/2011/03/09/data-driven-documents-for-visualization-in-the-browser/">Data-Driven Documents</a>, or D3 for short, for a while now. So put the two together, and <a href="http://projects.flowingdata.com/timeuse/">this is what I got</a>.</p>
<p>Main takeaway: we spend most of our time sleeping, eating, working, and watching television.</p>
<p>I followed the NYT aesthetic mostly for myself. I've found that a good way to learn is to try to copy something you like, and then use what you learn to do your own thing.</p>
<p>The NYT version was a stacked area chart with a number of interactions that made the data easier to read. I took a different route and split up some of the main activities, such as sleeping, eating, and working, into separate time series charts to see if it allowed you to see anything new. Showing separate charts at once places more focus on comparisons than on the distributions.</p>
<p>There was one challenge with the data that I didn't anticipate. ATUS provides a table of percentages by hour, but you'll notice with the NYT graphic the numbers are per ten minutes. You can actually calculate this with the ATUS microdata, which is basically the raw survey data from a few thousand respondents. I did this at first but lost patience, because I really just wanted to play with D3 rather than spend all my time building estimates. </p>
<p>I went with the hourly data which are averages for the sample population. The demographic breakdowns were only available in PDF, so I had some data entry fun, too. Luckily, I got my hands on Able2Extract (going off of Matthew Ericson's <a href="http://flowingdata.com/2011/09/14/pdf-data-woes/#comment-87162">advice</a>) to ease some of the pain.</p>
<p>Once the graphic was done, I noticed that the transitions are actually a nice way to show differences. For example, if you look at the time use for men and then women, the differences are subtle, but because the change is animated it's easier to spot. I think I knew this already, but probably never thought about it very deeply.</p>
<p>Anyways, it was fun playing around with D3. The beginning learning curve was kind of steep for me, but now that I know my way around a little better, I'm expecting more fun to come.</p>
<p>Mess around with the final graphic <a href="http://projects.flowingdata.com/timeuse/">here</a>. Let me know what you think.</p>
<p>[<a href="http://projects.flowingdata.com/timeuse/">How Americans Spend Their Day</a>]</p>
]]></content:encoded>
			<wfw:commentRss>http://flowingdata.com/2011/09/20/how-do-americans-spend-their-days/feed/</wfw:commentRss>
		<slash:comments>13</slash:comments>
		</item>
		<item>
		<title>Brand sentiment showdown</title>
		<link>http://flowingdata.com/2011/07/25/brand-sentiment-showdown/</link>
		<comments>http://flowingdata.com/2011/07/25/brand-sentiment-showdown/#comments</comments>
		<pubDate>Mon, 25 Jul 2011 07:08:30 +0000</pubDate>
		<dc:creator>Nathan Yau</dc:creator>
				<category><![CDATA[Projects]]></category>
		<category><![CDATA[R]]></category>
		<category><![CDATA[sentiment]]></category>
		<category><![CDATA[Twitter]]></category>

		<guid isPermaLink="false">http://flowingdata.com/?p=17921</guid>
		<description><![CDATA[<p><a href="http://flowingdata.com/2011/07/25/brand-sentiment-showdown/"><img width="450" height="443" src="http://flowingdata.com/wp-content/uploads/2011/07/movies-revised.png" class="attachment-medium wp-post-image" alt="movies-revised" title="movies-revised" /></a></p>There are many brands on Twitter that exist to uphold an image of the company they represent. As consumers, we &#8230;]]></description>
			<content:encoded><![CDATA[<p><a href="http://flowingdata.com/2011/07/25/brand-sentiment-showdown/"><img width="450" height="443" src="http://flowingdata.com/wp-content/uploads/2011/07/movies-revised.png" class="attachment-medium wp-post-image" alt="movies-revised" title="movies-revised" /></a></p><p>There are many brands on Twitter that exist to uphold an image of the company they represent. As consumers, we can communicate with these accounts, voicing praise or displeasure (usually the latter). Using a simple sentiment classifier<sup><a href="http://flowingdata.com/2011/07/25/brand-sentiment-showdown/#footnote1">1</a></sup>, I scored feelings towards major brands from 0 (horrible) to 100 (excellent) once a day for five days. </p>
<p>The above for example, shows scores for Netflix, Hulu, and Redbox. Netflix had the lowest scores, whereas Redbox had the highest. I suspect Netflix started low with people still upset over the price hike, but it got better the next couple of days. Then on Saturday, there was a score drop, which I'm guessing was from their downtime for most of Saturday. Hulu and Redbox, on the other hand, held more steady scores.</p>
<p>As for auto brands, Toyota clearly had the lowest scores. However, Lexus, which is actually a luxury vehicle division of Toyota had the highest scores in the high 90s to 100.</p>
<p><img src="http://flowingdata.com/wp-content/uploads/2011/07/cars-revised.png" alt="" title="Car showdown" width="450" height="765" class="alignnone size-full wp-image-17923" /></p>
<p>How about the major mobile phone companies, AT&T, Verizon, and Sprint? Verizon scored better initially, but had lower scores during the weekend. Not sure what was going on with Sprint.</p>
<p><img src="http://flowingdata.com/wp-content/uploads/2011/07/phone-revised.png" alt="" title="Phone showdown" width="450" height="443" class="alignnone size-full wp-image-17925" /></p>
<p>Between Twitter and Facebook, there was obviously some bias, but Twitter faired slightly better. Twitter scored lower than I expected, but it probably has to do with bug reports directed towards @twitter.</p>
<p><img src="http://flowingdata.com/wp-content/uploads/2011/07/social-revised.png" alt="" title="social-revised" width="450" height="356" class="alignnone size-full wp-image-17927" /></p>
<p>Is Domino's Pizza good now? Papa John's stayed fairly steady while Pizza Hut scores were sub-par.</p>
<p><img src="http://flowingdata.com/wp-content/uploads/2011/07/pizza-revised.png" alt="" title="Pizza" width="450" height="443" class="alignnone size-full wp-image-17926" /></p>
<p>Finally, as a sanity check, I compared airlines like Breen did in his tutorial. Results were similar with JetBlue and Southwest clearly in the positive and the others picking up the rear.</p>
<p><img src="http://flowingdata.com/wp-content/uploads/2011/07/airlines-revised.png" alt="" title="Airlines ranked" width="450" height="796" class="alignnone size-full wp-image-17922" /></p>
<p>Any of these scores seem surprising to you?</p>
<hr />
<ol class="footnotes">
<li id="footnote1">Jeffrey Breen provides an <a href="http://jeffreybreen.wordpress.com/2011/07/04/twitter-text-mining-r-slides/">easy-to-follow tutorial</a> on Twitter sentiment in R. The scoring system is pretty basic. All you do is load tweets with a given search phrase, and then find all the "good" words and "bad" words. Good words give +1, and bad words give -1. Then a tweet is classified good or bad based on the total. Then to get a final score, only tweets with total of +2 or more or -2 or less are counted. The final score is computed by dividing number of negative tweets divided by total number of "extreme" tweets. Obviously this won't pick up on sarcasm, but the scoring seems to still do a decent job. I wouldn't make any important business decisions based on these results though.</li>
</ol>
]]></content:encoded>
			<wfw:commentRss>http://flowingdata.com/2011/07/25/brand-sentiment-showdown/feed/</wfw:commentRss>
		<slash:comments>11</slash:comments>
		</item>
		<item>
		<title>Where the aliens are flying their UFOs</title>
		<link>http://flowingdata.com/2011/07/07/where-the-aliens-are-flying-their-ufos/</link>
		<comments>http://flowingdata.com/2011/07/07/where-the-aliens-are-flying-their-ufos/#comments</comments>
		<pubDate>Thu, 07 Jul 2011 10:48:54 +0000</pubDate>
		<dc:creator>Nathan Yau</dc:creator>
				<category><![CDATA[Projects]]></category>
		<category><![CDATA[featured]]></category>
		<category><![CDATA[sightings]]></category>
		<category><![CDATA[UFO]]></category>

		<guid isPermaLink="false">http://flowingdata.com/?p=17652</guid>
		<description><![CDATA[<p><a href="http://flowingdata.com/2011/07/07/where-the-aliens-are-flying-their-ufos/"><img width="625" height="406" src="http://flowingdata.com/wp-content/uploads/2011/07/ufo_map_annotated1-625x406.png" class="attachment-medium wp-post-image" alt="UFO Sightings" title="UFO Sightings" /></a></p>I came across some UFO sightings data on Infochimps, from the National UFO Reporting Center, and it seemed like a &#8230;]]></description>
			<content:encoded><![CDATA[<p><a href="http://flowingdata.com/2011/07/07/where-the-aliens-are-flying-their-ufos/"><img width="625" height="406" src="http://flowingdata.com/wp-content/uploads/2011/07/ufo_map_annotated1-625x406.png" class="attachment-medium wp-post-image" alt="UFO Sightings" title="UFO Sightings" /></a></p><p>I came across some <a href="http://www.infochimps.com/datasets/60000-documented-ufo-sightings-with-text-descriptions-and-metada">UFO sightings data</a> on Infochimps, from the <a href="http://www.nuforc.org/">National UFO Reporting Center</a>, and it seemed like a good excuse to mess around in R. I was just playing around, but the globular result was kind of fun to look at, so here it is.</p>
<p>The dataset is 60,000 sightings, but the above shows about 45,000 locations that could be geocoded immediately. The whiter the region, the more sightings there were in the area from 1906 to 2007. </p>
<p>Is it just me, or does the map above match up with this other map of major US airports?</p>
<p><img src="http://flowingdata.com/wp-content/uploads/2011/07/Major-US-airports-625x359.jpg" alt="" title="Major US airports" width="625" height="359" class="alignnone size-medium wp-image-17661" /></p>
<p>Could some of the sightings actually be... airplanes? Nah. The aliens are coming. Luckily I have a big bat, several glasses of water, and asthma to keep me safe.</p>
]]></content:encoded>
			<wfw:commentRss>http://flowingdata.com/2011/07/07/where-the-aliens-are-flying-their-ufos/feed/</wfw:commentRss>
		<slash:comments>35</slash:comments>
		</item>
		<item>
		<title>Largest data breaches of all time</title>
		<link>http://flowingdata.com/2011/06/13/largest-data-breaches-of-all-time/</link>
		<comments>http://flowingdata.com/2011/06/13/largest-data-breaches-of-all-time/#comments</comments>
		<pubDate>Mon, 13 Jun 2011 08:36:16 +0000</pubDate>
		<dc:creator>Nathan Yau</dc:creator>
				<category><![CDATA[Projects]]></category>
		<category><![CDATA[breaches]]></category>
		<category><![CDATA[featured]]></category>
		<category><![CDATA[Sony]]></category>

		<guid isPermaLink="false">http://flowingdata.com/?p=17249</guid>
		<description><![CDATA[<p><a href="http://flowingdata.com/2011/06/13/largest-data-breaches-of-all-time/"><img width="625" height="543" src="http://flowingdata.com/wp-content/uploads/2011/06/major-data-breaches-updated-625x543.png" class="attachment-medium wp-post-image" alt="Largest data breaches of all time" title="Largest data breaches of all time" /></a></p>As I'm sure you know, Sony has been having all sorts of data breach problems lately &#8212; namely a million &#8230;]]></description>
			<content:encoded><![CDATA[<p><a href="http://flowingdata.com/2011/06/13/largest-data-breaches-of-all-time/"><img width="625" height="543" src="http://flowingdata.com/wp-content/uploads/2011/06/major-data-breaches-updated-625x543.png" class="attachment-medium wp-post-image" alt="Largest data breaches of all time" title="Largest data breaches of all time" /></a></p><p>As I'm sure you know, Sony has been having all sorts of data breach problems lately &mdash; namely <a href="http://flowingdata.com/2011/06/13/analysis-of-passwords-in-sony-pictures-security-breach/">a million passwords</a> from the Sony Pictures site, <a href="http://www.cnn.com/2011/TECH/gaming.gadgets/04/26/playstation.network.hack/index.html">77 million accounts</a> from the PlayStation Network, and nearly <a href="http://mashable.com/2011/05/03/sony-another-hacker-attack/">25 million user accounts</a> from Online Entertainment. I was curious how these recent attacks compared to the largest known data loss incidents, so I headed over to <a href="http://datalossdb.org/">DataLossDB</a>. Sony now holds spots #4 and #10 for largest breaches of all time. That can't be good.</p>
<p>Below: a timeline of all known Sony data breaches so far this year, the biggest on April 26 and the second biggest soon after on May 2. More to come?</p>
<p><img src="http://flowingdata.com/wp-content/uploads/2011/06/sony-timeline.png" alt="" title="Sony Data Breaches - 2011" width="625" height="419" class="alignnone size-full wp-image-17267" /></p>
]]></content:encoded>
			<wfw:commentRss>http://flowingdata.com/2011/06/13/largest-data-breaches-of-all-time/feed/</wfw:commentRss>
		<slash:comments>35</slash:comments>
		</item>
		<item>
		<title>Seven year itch: When do people get married and divorced?</title>
		<link>http://flowingdata.com/2011/05/23/when-do-people-get-married-and-divorced/</link>
		<comments>http://flowingdata.com/2011/05/23/when-do-people-get-married-and-divorced/#comments</comments>
		<pubDate>Mon, 23 May 2011 07:57:06 +0000</pubDate>
		<dc:creator>Nathan Yau</dc:creator>
				<category><![CDATA[Projects]]></category>
		<category><![CDATA[census]]></category>
		<category><![CDATA[featured]]></category>
		<category><![CDATA[marriage]]></category>

		<guid isPermaLink="false">http://flowingdata.com/?p=16799</guid>
		<description><![CDATA[<p><a href="http://flowingdata.com/2011/05/23/when-do-people-get-married-and-divorced/"><img width="625" height="535" src="http://flowingdata.com/wp-content/uploads/2011/05/Getting-Married-Later-larger-625x535.png" class="attachment-medium wp-post-image" alt="Getting-Married-Later-larger" title="Getting-Married-Later-larger" /></a></p>The United States Census Bureau just released results from the Survey of Income and Program Participation (SIPP) on marriage and &#8230;]]></description>
			<content:encoded><![CDATA[<p><a href="http://flowingdata.com/2011/05/23/when-do-people-get-married-and-divorced/"><img width="625" height="535" src="http://flowingdata.com/wp-content/uploads/2011/05/Getting-Married-Later-larger-625x535.png" class="attachment-medium wp-post-image" alt="Getting-Married-Later-larger" title="Getting-Married-Later-larger" /></a></p><p>The United States Census Bureau just <a href="http://www.census.gov/hhes/socdemo/marriage/data/sipp/index.html">released results</a> from the Survey of Income and Program Participation (SIPP) on marriage and divorce, and my wife and I just celebrated an anniversary this past weekend, so naturally I had to take a look.</p>
<p>The survey of about 39,000 households was actually taken in 2009 (The government can be slow sometimes.), but it provides a glimpse of how marriage and divorce has changed when you compare it to surveys from previous years.</p>
<p>My main takeaway was that people appear to be getting married at an older age, and as you get into the older age groups, the percentages for people who have married at least once are in the high 90s. The former doesn't surprise me simply because it matches with personal observation. The second part though was slightly surprising. For some reason, I always thought there were more people who went their whole lives without getting married.</p>
<p>Can you find any other interesting tidbits in the data?</p>
]]></content:encoded>
			<wfw:commentRss>http://flowingdata.com/2011/05/23/when-do-people-get-married-and-divorced/feed/</wfw:commentRss>
		<slash:comments>18</slash:comments>
		</item>
		<item>
		<title>Geographic breakdown: Where do major airlines fly?</title>
		<link>http://flowingdata.com/2011/05/05/where-do-major-airlines-fly-in-the-united-states/</link>
		<comments>http://flowingdata.com/2011/05/05/where-do-major-airlines-fly-in-the-united-states/#comments</comments>
		<pubDate>Thu, 05 May 2011 07:06:00 +0000</pubDate>
		<dc:creator>Nathan Yau</dc:creator>
				<category><![CDATA[Data Underload]]></category>
		<category><![CDATA[airlines]]></category>
		<category><![CDATA[featured]]></category>
		<category><![CDATA[flights]]></category>

		<guid isPermaLink="false">http://flowingdata.com/?p=16404</guid>
		<description><![CDATA[<p><a href="http://flowingdata.com/2011/05/05/where-do-major-airlines-fly-in-the-united-states/"><img width="625" height="418" src="http://flowingdata.com/wp-content/uploads/2011/05/all-625x418.png" class="attachment-medium wp-post-image" alt="Where America flies" title="Where America flies" /></a></p>Ever since seeing the Facebook friendship map and later, the map of scientific collaboration, I've been looking for an excuse &#8230;]]></description>
			<content:encoded><![CDATA[<p><a href="http://flowingdata.com/2011/05/05/where-do-major-airlines-fly-in-the-united-states/"><img width="625" height="418" src="http://flowingdata.com/wp-content/uploads/2011/05/all-625x418.png" class="attachment-medium wp-post-image" alt="Where America flies" title="Where America flies" /></a></p><p>Ever since seeing the <a href="http://flowingdata.com/2010/12/13/facebook-worldwide-friendships-mapped/">Facebook friendship map</a> and later, the <a href="http://flowingdata.com/2011/01/27/map-of-scientific-collaboration-between-researchers/">map of scientific collaboration</a>, I've been looking for an excuse to play with great circles. So I thought, why not come back to Aaron Koblin's classic <a href="http://flowingdata.com/2007/09/23/visualization-of-us-flight-patterns/">Flight Patterns</a>? But instead of just looking at all flights (above), I broke it down by airline to see where each one flies.</p>
<p>I grabbed the most recent flight data from the Bureau of Transportation Statistics, aggregated by airline, and counted arriving and departing flights between airport pairs. What follows are non-stop domestic flights by major air carriers during February 2011. </p>
<p>Brighter lines represent more arriving and departing flights between the two endpoints, and blue lines are the flights with heaviest traffic. Coloring is relative to within the airline as opposed to overall flight count. </p>
<p>On a quick glance you can spot where the hubs of each carrier are and flights most often flown. We start off with Southwest Airlines, which flies across the country. There's a focus obviously in the southwest.</p>
<p><img class="alignnone size-full wp-image-16419" title="WN" src="http://flowingdata.com/wp-content/uploads/2011/05/WN.png" alt="" width="575" height="385" /></p>
<p>Delta Air Lines flies just about everywhere, too, but also includes flights to Alaska and Hawaii. Their largest hub is in Atlanta, which explains the focus at Hartsfield–Jackson Atlanta International Airport.</p>
<p><img class="alignnone size-full wp-image-16410" title="DL" src="http://flowingdata.com/wp-content/uploads/2011/05/DL.png" alt="" width="575" height="385" /></p>
<p>United Airlines, on the other hand, has hubs more north and on the west coast including O'Hare International in Chicago and San Francisco International Airport. It appears they also have flights to all major Hawaiian islands.</p>
<p><img class="alignnone size-full wp-image-16417" title="UA" src="http://flowingdata.com/wp-content/uploads/2011/05/UA.png" alt="" width="575" height="385" /></p>
<p>Lots of American Airlines traffic in an out of Dallas/Fort Worth International Airport and JFK.</p>
<p><img class="alignnone size-full wp-image-16406" title="AA" src="http://flowingdata.com/wp-content/uploads/2011/05/AA.png" alt="" width="575" height="385" /></p>
<p>Continental Airlines looks similar to American Airlines, except Continental's headquarters are in Houston, Texas.</p>
<p><img class="alignnone size-full wp-image-16409" title="CO" src="http://flowingdata.com/wp-content/uploads/2011/05/CO.png" alt="" width="575" height="385" /></p>
<p>Pretty obvious where JetBlue goes. Despite some delays the past couple of times I've flown with them, they're still my favorite. One time Bill Murray was on the flight. If it's good enough for him, it must be good enough for me.</p>
<p><img class="alignnone size-full wp-image-16408" title="B6" src="http://flowingdata.com/wp-content/uploads/2011/05/B6.png" alt="" width="575" height="385" /></p>
<p>Mesa is a smaller airline that also operates United Express and US Airways Express.</p>
<p><img class="alignnone size-full wp-image-16421" title="YV" src="http://flowingdata.com/wp-content/uploads/2011/05/YV.png" alt="" width="575" height="385" /></p>
<p><img class="alignnone size-full wp-image-16415" title="MQ" src="http://flowingdata.com/wp-content/uploads/2011/05/MQ.png" alt="" width="575" height="385" /></p>
<p><img class="alignnone size-full wp-image-16416" title="OO" src="http://flowingdata.com/wp-content/uploads/2011/05/OO.png" alt="" width="575" height="385" /></p>
<p>US Airways' largest hub is at Charlotte/Douglas International Airport.</p>
<p><img class="alignnone size-full wp-image-16418" title="US" src="http://flowingdata.com/wp-content/uploads/2011/05/US.png" alt="" width="575" height="385" /></p>
<p><img class="alignnone size-full wp-image-16420" title="XE" src="http://flowingdata.com/wp-content/uploads/2011/05/XE.png" alt="" width="575" height="385" /></p>
<p>The Alaska Airlines connections look really interesting, streaming out of the the northwest. Most flights go through Seattle-Tacoma International, but there are also flights to and from Portland International. Oh, and of course to and from Alaska.</p>
<p><img class="alignnone size-full wp-image-16407" title="AS" src="http://flowingdata.com/wp-content/uploads/2011/05/AS.png" alt="" width="575" height="385" /></p>
<p>Atlantic Southeast lives up to its name.</p>
<p><img class="alignnone size-full wp-image-16411" title="EV" src="http://flowingdata.com/wp-content/uploads/2011/05/EV.png" alt="" width="575" height="385" /></p>
<p>As does Frontier Airlines.</p>
<p><img class="alignnone size-full wp-image-16412" title="F9" src="http://flowingdata.com/wp-content/uploads/2011/05/F9.png" alt="" width="575" height="385" /></p>
<p><img class="alignnone size-full wp-image-16413" title="FL" src="http://flowingdata.com/wp-content/uploads/2011/05/FL.png" alt="" width="575" height="385" /></p>
<p>Hawaiian Airlines looks exactly like you'd expect. They exclusively fly to Raleigh, North Carolina. Ah, I kid. I don't know where they fly.</p>
<p><img class="alignnone size-full wp-image-16414" title="HA" src="http://flowingdata.com/wp-content/uploads/2011/05/HA.png" alt="" width="575" height="385" /></p>
<p>See how these were made and how to apply this to your own data in this <a href="http://flowingdata.com/2011/05/11/how-to-map-connections-with-great-circles/">step-by-step tutorial</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://flowingdata.com/2011/05/05/where-do-major-airlines-fly-in-the-united-states/feed/</wfw:commentRss>
		<slash:comments>27</slash:comments>
		</item>
	</channel>
</rss>

