<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>FlowingData &#187; Statistics</title>
	<atom:link href="http://flowingdata.com/category/statistics/feed/" rel="self" type="application/rss+xml" />
	<link>http://flowingdata.com</link>
	<description>Strength in Numbers</description>
	<lastBuildDate>Fri, 10 Feb 2012 20:28:38 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
	<atom:link rel="next" href="http://flowingdata.com/category/statistics/feed/?page=2" />

		<item>
		<title>Why I want to quit cable</title>
		<link>http://flowingdata.com/2012/02/09/why-i-want-to-quit-cable/</link>
		<comments>http://flowingdata.com/2012/02/09/why-i-want-to-quit-cable/#comments</comments>
		<pubDate>Thu, 09 Feb 2012 08:24:47 +0000</pubDate>
		<dc:creator>Nathan Yau</dc:creator>
				<category><![CDATA[Statistics]]></category>
		<category><![CDATA[Comcast]]></category>
		<category><![CDATA[entertainment]]></category>
		<category><![CDATA[featured]]></category>
		<category><![CDATA[Roku]]></category>

		<guid isPermaLink="false">http://flowingdata.com/?p=21572</guid>
		<description><![CDATA[<p><a href="http://flowingdata.com/2012/02/09/why-i-want-to-quit-cable/"><img width="625" height="401" src="http://flowingdata.com/wp-content/uploads/2012/02/cutting-cable.png" class="attachment-medium wp-post-image" alt="cutting-cable" title="cutting-cable" /></a></p>There are good reasons to cancel cable, but there were a few channels and programs that kept me on. When you look at it in dollars though, it's hard to justify the value for the cost.]]></description>
			<content:encoded><![CDATA[<p><a href="http://flowingdata.com/2012/02/09/why-i-want-to-quit-cable/"><img width="625" height="401" src="http://flowingdata.com/wp-content/uploads/2012/02/cutting-cable.png" class="attachment-medium wp-post-image" alt="cutting-cable" title="cutting-cable" /></a></p><p>Growing up, most of my friends had cable television, but whenever I brought it up, my parents would always say that I watched enough TV already (which was true). So it was always a treat when we went somewhere like a hotel, where there were more than five channels. I didn't know what any of the shows were, but it sure was fun pressing buttons on the remote. Today, I still don't know what most of the shows are, but the novelty is gone.</p>
<p>Nowadays, I have different choices (and priorities). I can entertain myself online, and services like Netflix and Hulu make that easier. When I do turn on the TV, it's often just for background noise as I cook dinner or do something on the computer.</p>
<p>I almost never watch shows when they actually come on, and I only know the schedules of a few of them. And nowadays, the gift of choice feels more like a waste, as I flip through sixty something channels and see nothing that I want to watch. </p>
<p>The other day I thought to myself, "I'm paying forty bucks per month to watch Groundhog Day. Again." But then I looked at the cable bill that I had not looked at in a year, since it's on auto-pay. I'm paying $64.99 for digital cable from Comcast, plus $15.95 for HD and DVR, and then there's about $5 in taxes and fees. The introductory price ran out long ago. </p>
<p>I could buy an obscene number of tacos from Jack in the Box with that cash.</p>
<p>So I looked into cutting the cord completely. I want to save money, but more importantly, I want to get more of what I want for my money. Toss the channels and shows I don't watch.</p>
<p><img src="http://flowingdata.com/wp-content/uploads/2012/02/taco-comparison.png" alt="" title="taco comparison" width="364" height="283" class="alignright size-full wp-image-21644" />At $85.91 per month for the most basic HD plan from Comcast, that comes in at just over a grand per year. With Netflix and Hulu, it's $15.98 per month, or just under $200 per year. That's a big gap between Comcast and Hulu+Netflix. $839.16, to be exact, which is quite a buffer. </p>
<p>Of course you need a device if you don't already have one to play Netflix and Hulu on your television. A <a href="http://www.amazon.com/Roku-XD-Streaming-Player-1080p/dp/B005CLPP8E/?tag=flowingdata-20">Roku</a> costs between $50 and $100, and an <a href="http://www.apple.com/appletv/">Apple TV</a> is about $100. <strong>Current difference: $739.16.</strong>  </p>
<p>Also, you don't get all you want with just Hulu and Netflix. Personally, I watch basketball when good games are on. The NBA League Pass lets you watch (more) games over broadband though, on a Roku or Apple TV. That's $109 for the season. But, and it's a big one, in-market and nationally broadcast games aren't available via the League Pass. More on this to follow. <strong>Current difference: $630.16.</strong> </p>
<p>Then to get local channels, you can still use an antenna. The bestselling antenna on Amazon is $35.99. <strong>Current difference: $594.17.</strong></p>
<p>Finally, that leaves a healthy amount to buy and rent shows and movies not available on Hulu or Netflix, which you can get on iTunes and Amazon Instant. For example, the pass for this season's <em>How I Met Your Mother</em> is $26.47. You could buy (and own) 22 full seasons of your favorite shows with the available buffer. I'm pretty patient though and don't mind waiting for stuff to become available on Netflix. I just need to be able to watch sports live. My wife has been really into Downton Abbey, and the season pass is $17.04. It's free on PBS, but she usually can't watch it when it airs. <strong>Current difference: $577.13.</strong></p>
<p>After all the additional stuff, that's $577.13. Over 1,000 tacos. </p>
<p>But back to the basketball problem. Since League Pass doesn't get me nationally broadcast games, that means I wouldn't get most of the playoffs on ESPN and TNT. (I suspect the same for hockey, baseball, and football.) That's the most important part of the season, save the finals, which are broadcast on ABC. And my wife really likes HGTV and a handful of reality shows that aren't available on Roku, Apple TV, iTunes, or Amazon. Crud.</p>
<p>By the numbers and tacos, it makes sense to cut the cord. From a perspective of want though, it's harder to let go. It comes down to this: Is a year of a tiny subset of programming on cable and playoff games not available on ABC worth $577.13?</p>
]]></content:encoded>
			<wfw:commentRss>http://flowingdata.com/2012/02/09/why-i-want-to-quit-cable/feed/</wfw:commentRss>
		<slash:comments>60</slash:comments>
		</item>
		<item>
		<title>An action plan for data science, a decade ago</title>
		<link>http://flowingdata.com/2012/02/03/an-action-plan-for-data-science-a-decade-ago/</link>
		<comments>http://flowingdata.com/2012/02/03/an-action-plan-for-data-science-a-decade-ago/#comments</comments>
		<pubDate>Fri, 03 Feb 2012 11:58:20 +0000</pubDate>
		<dc:creator>Nathan Yau</dc:creator>
				<category><![CDATA[Statistics]]></category>
		<category><![CDATA[data science]]></category>
		<category><![CDATA[William Cleveland]]></category>

		<guid isPermaLink="false">http://flowingdata.com/?p=21308</guid>
		<description><![CDATA[Data science has been covered at length during the past couple of years, and we tend to think of it &#8230;]]></description>
			<content:encoded><![CDATA[<p>Data science has been <a href="http://www.dataists.com/2010/09/a-taxonomy-of-data-science/">covered</a> <a href="http://www.drewconway.com/zia/?p=2378">at</a> <a href="http://radar.oreilly.com/2010/06/what-is-data-science.html">length</a> <a href="http://jobs.aol.com/articles/2011/08/10/data-scientist-the-hottest-job-you-havent-heard-of/">during</a> <a href="http://blog.revolutionanalytics.com/2011/09/data-science-a-literature-review.html">the</a> <a href="http://www.dataspora.com/2009/05/sexy-data-geeks/">past</a> <a href="http://www.quora.com/What-is-data-science">couple</a> of years, and we tend to think of it as a field of study just a couple of years older than that. Jeff Hammerbacher and DJ Patil have <a href="http://radar.oreilly.com/2011/09/building-data-science-teams.html">played roles</a> in further propagating the term as an actual profession in roughly the same timespan. So I was surprised to come across this rarely-cited 2001 paper by statistician William Cleveland, <a href="http://cm.bell-labs.com/cm/ms/departments/sia/doc/datascience.pdf">Data Science: An Action Plan for Expanding the Technical Areas of the Field of Statistics</a> [pdf].</p>
<blockquote><p>This document describes a plan to enlarge the major areas of technical work of the ﬁeld of statistics. Because the plan is ambitious and implies substantial change, the altered ﬁeld will be called "data science."</p></blockquote>
<p>For those unfamiliar, Cleveland's work on <a href="http://flowingdata.com/2010/03/20/graphical-perception-learn-the-fundamentals-first/">graphical perception</a> might ring a bell.</p>
<p>The first time I heard "data science" was in 2007 while reading a proposal that my adviser had passed along, outlining an academic program similar to what we think of as data science. Now that I think of it, the proposal probably had a lot of similarities to the program outlined by Cleveland (which I would have signed up for in a heartbeat).</p>
<p>Cleveland outlines six areas and the percentage of focus for each.</p>
<ul>
<li><strong>Multidisciplinary Investigation</strong> (25%) &mdash; collaboration with subject areas</li>
<li><strong>Models and Methods for Data</strong> (20%) &mdash; more traditional applied statistics</li>
<li><strong>Computing with Data</strong> (15%) &mdash; hardware, software, and algorithms</li>
<li><strong>Pedagogy</strong> (15%) &mdash; how to teach the subject</li>
<li><strong>Tool Evaluation</strong> (5%) &mdash; keeping track of new tech</li>
<li><strong>Theory</strong> (20%) &mdash; the math behind the data</li>
</ul>
<p>That sounds like what we associate data science with, but current practitioners focus more on tools and less on pedagogy and theory. Although that's not to say that data scientists today couldn't benefit for more traditional statistical knowledge under their belt. And the same goes for statisticians learning more about how to use the tools available (and how to make the tools themselves).</p>
<p>Cleveland's overall theme is a melting of various fields that obviously fit well together.</p>
<blockquote><p>Computer scientists, waking up to the value of the information stored, processed, and transmitted by today's computing environments, have attempted to ﬁll the void. One current of work is data mining. But the beneﬁt to the data analyst has been limited, because the knowledge among computer scientists about how to think of and approach the analysis of data is limited, just as the knowledge of computing environments by statisticians is limited. A merger of the knowledge bases would produce a powerful force for innovation.</p></blockquote>
<p>Sounds <a href="http://flowingdata.com/2009/06/04/rise-of-the-data-scientist/">familiar</a>. Of course, John Tukey seemed to have the <a href="http://flowingdata.com/2008/01/01/john-tukey-and-the-beginning-of-interactive-graphics/">right idea</a> in the 1970s. In any case, it was refreshing to find this from a statistician over the canned stat scoff that data science <em>is</em> statistics.</p>
<p>[<a href="http://www.stat.purdue.edu/~wsc/papers.html">William Cleveland</a> via @<a href="http://twitter.com/drewconway">drewconway</a>]</p>
]]></content:encoded>
			<wfw:commentRss>http://flowingdata.com/2012/02/03/an-action-plan-for-data-science-a-decade-ago/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Challenges measuring crime worldwide</title>
		<link>http://flowingdata.com/2012/02/01/challenges-measuring-crime-worldwide/</link>
		<comments>http://flowingdata.com/2012/02/01/challenges-measuring-crime-worldwide/#comments</comments>
		<pubDate>Wed, 01 Feb 2012 08:52:43 +0000</pubDate>
		<dc:creator>Nathan Yau</dc:creator>
				<category><![CDATA[Statistics]]></category>
		<category><![CDATA[crime]]></category>
		<category><![CDATA[estimation]]></category>
		<category><![CDATA[measurement]]></category>

		<guid isPermaLink="false">http://flowingdata.com/?p=21284</guid>
		<description><![CDATA[You would think that something so concrete, carefully recorded by authorities, wouldn't be too tough to tabulate, even if at &#8230;]]></description>
			<content:encoded><![CDATA[<p>You would think that something so concrete, carefully recorded by authorities, wouldn't be too tough to tabulate, even if at a large scale. <a href="http://online.wsj.com/article/SB10001424052970204661604577185122672243992.html">Not so</a>.</p>
<blockquote><p>Homicide is a "serious crime that many people are concerned with, it is well-measured, and it is to a large degree well-reported and -recorded," says Alfred Blumstein, a criminologist at Carnegie Mellon University. "That is not to say that there aren't a variety of ways for fudging the measurement."</p>
<p>Among the factors that cloud homicide numbers: gaps between police-reported numbers and counts by public-health organizations. The discrepancy is wide in many African countries and some Caribbean ones. The United Nations attributes the disparity to several factors, including definitional differences—whether honor killings should count—a lack of public-health infrastructure in some countries, and undercounting—possibly deliberate—by police.</p></blockquote>
<p>I think this is something the common public often doesn't understand about data. The numbers are entered and analyzed on a computer, so it's easy to mistake data for mechanical output. It must be accurate, right? That's usually not the case though, especially when it comes to data collection outside a controlled lab setting. </p>
<p>The game always changes when humans are involved. Not everyone responds to surveys, definitions of events vary across organizations, estimation methods change every year, and the list goes on.</p>
<p>For those who do stuff with data, you have to deal with that uncertainty, and as data consumers, you have to remember that numbers don't automatically mean fact.</p>
<p>[<a href="http://online.wsj.com/article/SB10001424052970204661604577185122672243992.html">Wall Street Journal</a>]</p>
]]></content:encoded>
			<wfw:commentRss>http://flowingdata.com/2012/02/01/challenges-measuring-crime-worldwide/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Texting on the toilet</title>
		<link>http://flowingdata.com/2012/01/30/texting-on-the-toilet/</link>
		<comments>http://flowingdata.com/2012/01/30/texting-on-the-toilet/#comments</comments>
		<pubDate>Tue, 31 Jan 2012 07:04:56 +0000</pubDate>
		<dc:creator>Nathan Yau</dc:creator>
				<category><![CDATA[Data Sources]]></category>
		<category><![CDATA[texting]]></category>
		<category><![CDATA[toilet]]></category>

		<guid isPermaLink="false">http://flowingdata.com/?p=21361</guid>
		<description><![CDATA[I thought this riveting post on the New York Times Bits blog about the rise of the toilet texter deserved &#8230;]]></description>
			<content:encoded><![CDATA[<p><img src="http://flowingdata.com/wp-content/uploads/2012/01/toilet-texting.png" alt="" title="toilet-texting" width="399" height="425" class="alignright size-full wp-image-21362" />I thought this riveting post on the New York Times Bits blog about the <a href="http://bits.blogs.nytimes.com/2012/01/30/the-rise-of-the-toilet-texter/">rise of the toilet texter</a> deserved a graphic. Since their graphics department is no doubt busy with elections, I took the liberty. I am &mdash; the 91 percent.</p>
<p>I got the numbers straight from the Bits post, but you can download the full report <a href="http://www.11mark.com/IT-in-the-Toilet">from 11mark</a> for all the demographics. You have to register though, and I didn't want to be the guy who creates an online account to just read a report on what people do while they make dooty. I have <a href="http://flowingdata.com/2010/05/23/data-underload-21-exit-strategy/">standards</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://flowingdata.com/2012/01/30/texting-on-the-toilet/feed/</wfw:commentRss>
		<slash:comments>11</slash:comments>
		</item>
		<item>
		<title>More people want to learn statistics</title>
		<link>http://flowingdata.com/2012/01/27/more-people-want-to-learn-statistics/</link>
		<comments>http://flowingdata.com/2012/01/27/more-people-want-to-learn-statistics/#comments</comments>
		<pubDate>Fri, 27 Jan 2012 10:16:34 +0000</pubDate>
		<dc:creator>Nathan Yau</dc:creator>
				<category><![CDATA[Statistics]]></category>
		<category><![CDATA[education]]></category>
		<category><![CDATA[nytimes]]></category>

		<guid isPermaLink="false">http://flowingdata.com/?p=21229</guid>
		<description><![CDATA[Data is hot right now, so as you would expect, more people are signing up and applying to learn about &#8230;]]></description>
			<content:encoded><![CDATA[<p>Data is hot right now, so as you would expect, more people are <a href="http://bits.blogs.nytimes.com/2012/01/26/what-are-the-odds-that-stats-would-get-this-popular/">signing up and applying to learn about it</a>. Quentin Hardy for <em>The New York Times</em> reports.</p>
<blockquote><p>At North Carolina State, an advanced analytics program lasting 10 months has, since its founding in 2006, placed over 90 percent of its students annually. The average graduate’s starting salary for an entry-level job is $73,000. Its current class of 40 students had 185 applicants, and next year’s applications are already twice that. In 2009, Harvard awarded four undergraduate degrees in statistics. Two graduates went into finance, one to political polling and one became a substitute teacher. There were nine graduates in 2010, 13 last year. They headed into Google, biosciences and Wall Street, as well as Stanford's literature department.</p></blockquote>
<p>And in 2011, just about everywhere.</p>
<p>[<a href="http://bits.blogs.nytimes.com/2012/01/26/what-are-the-odds-that-stats-would-get-this-popular/">New York Times</a> via @<a href="http://twitter.com/jsteeleeditor">jsteeleeditor</a>]</p>
]]></content:encoded>
			<wfw:commentRss>http://flowingdata.com/2012/01/27/more-people-want-to-learn-statistics/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>The Fixie Bike Index and hipsters</title>
		<link>http://flowingdata.com/2012/01/27/the-fixie-bike-index-and-hipsters/</link>
		<comments>http://flowingdata.com/2012/01/27/the-fixie-bike-index-and-hipsters/#comments</comments>
		<pubDate>Fri, 27 Jan 2012 08:50:39 +0000</pubDate>
		<dc:creator>Nathan Yau</dc:creator>
				<category><![CDATA[Statistics]]></category>
		<category><![CDATA[fixie]]></category>
		<category><![CDATA[hipster]]></category>

		<guid isPermaLink="false">http://flowingdata.com/?p=21071</guid>
		<description><![CDATA[<p><a href="http://flowingdata.com/2012/01/27/the-fixie-bike-index-and-hipsters/"><img width="604" height="395" src="http://flowingdata.com/wp-content/uploads/2012/01/Hipster-places-in-America.png" class="attachment-medium wp-post-image" alt="Hipster places in America" title="Hipster places in America" /></a></p>Priceonomics takes the association of fixie bikes to hipsters, and creates the Fixie Bike Index. After starting with New York, &#8230;]]></description>
			<content:encoded><![CDATA[<p><a href="http://flowingdata.com/2012/01/27/the-fixie-bike-index-and-hipsters/"><img width="604" height="395" src="http://flowingdata.com/wp-content/uploads/2012/01/Hipster-places-in-America.png" class="attachment-medium wp-post-image" alt="Hipster places in America" title="Hipster places in America" /></a></p><p>Priceonomics takes the association of fixie bikes to hipsters, and creates the <a href="http://blog.priceonomics.com/post/16013457968/the-fixie-bike-index">Fixie Bike Index</a>. After starting with New York, they branch out to national numbers.</p>
<blockquote><p>In short, fixed gear bikes = hipsters, and New York boroughs that have more fixies per capita should have more hipsters per capita. We sampled our data to see the number of used bikes for sale per capita in each borough with the term "fixie" or "fixed gear" in the product title to create the Fixie Index.</p></blockquote>
<p>I don't know about these numbers. I lived in Modesto for a year and don't remember people riding bikes &mdash; or hipsters, and riding your bike in Los Angeles kind of sucks.</p>
<p>[<a href="http://blog.priceonomics.com/post/16013457968/the-fixie-bike-index">Priceonomics</a>]</p>
]]></content:encoded>
			<wfw:commentRss>http://flowingdata.com/2012/01/27/the-fixie-bike-index-and-hipsters/feed/</wfw:commentRss>
		<slash:comments>18</slash:comments>
		</item>
		<item>
		<title>Social network analysis used to convict slumlords</title>
		<link>http://flowingdata.com/2012/01/19/social-network-analysis-used-to-convict-slumlords/</link>
		<comments>http://flowingdata.com/2012/01/19/social-network-analysis-used-to-convict-slumlords/#comments</comments>
		<pubDate>Thu, 19 Jan 2012 08:27:56 +0000</pubDate>
		<dc:creator>Nathan Yau</dc:creator>
				<category><![CDATA[Statistics]]></category>
		<category><![CDATA[crime]]></category>
		<category><![CDATA[law]]></category>

		<guid isPermaLink="false">http://flowingdata.com/?p=20953</guid>
		<description><![CDATA[<p><a href="http://flowingdata.com/2012/01/19/social-network-analysis-used-to-convict-slumlords/"><img width="625" height="312" src="http://flowingdata.com/wp-content/uploads/2012/01/social-network-analysis-625x312.gif" class="attachment-medium wp-post-image" alt="social network analysis" title="social network analysis" /></a></p>In working with tenants to help their city attorney convict a group of slumlords, an economic justice organization collected public &#8230;]]></description>
			<content:encoded><![CDATA[<p><a href="http://flowingdata.com/2012/01/19/social-network-analysis-used-to-convict-slumlords/"><img width="625" height="312" src="http://flowingdata.com/wp-content/uploads/2012/01/social-network-analysis-625x312.gif" class="attachment-medium wp-post-image" alt="social network analysis" title="social network analysis" /></a></p><p>In working with tenants to help their city attorney convict a group of slumlords, an economic justice organization collected public data on housing violations that were going unfixed. They tried standard mind mapping and organization software, but the relationships were too complex to unearth anything useful. So <a href="http://www.orgnet.com/slumlords.html">they eventually used social network analysis</a>, revealing money exchanging hands in such a way that allowed owners to strip the value from buildings without actually fixing them.</p>
<p>The analysis results, combined with the city's investigation, allowed key convictions and court-awarded finances for tenants to move elsewhere.</p>
<p>Sounds like a good reason for <a href="http://datawithoutborders.cc/">Data Without Borders</a>. </p>
<p>[<a href="http://www.orgnet.com/slumlords.html">Valdis Krebs</a> via <a href="http://kottke.org/12/01/taking-down-slumlords-with-social-network-analysis">kottke</a>]</p>
]]></content:encoded>
			<wfw:commentRss>http://flowingdata.com/2012/01/19/social-network-analysis-used-to-convict-slumlords/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Lego mathematics and growing complexity in networks</title>
		<link>http://flowingdata.com/2012/01/12/lego-mathematics-and-growing-complexity-in-networks/</link>
		<comments>http://flowingdata.com/2012/01/12/lego-mathematics-and-growing-complexity-in-networks/#comments</comments>
		<pubDate>Thu, 12 Jan 2012 08:03:24 +0000</pubDate>
		<dc:creator>Nathan Yau</dc:creator>
				<category><![CDATA[Statistics]]></category>
		<category><![CDATA[Lego]]></category>
		<category><![CDATA[networks]]></category>

		<guid isPermaLink="false">http://flowingdata.com/?p=20786</guid>
		<description><![CDATA[<p><a href="http://flowingdata.com/2012/01/12/lego-mathematics-and-growing-complexity-in-networks/"><img width="539" height="378" src="http://flowingdata.com/wp-content/uploads/2012/01/lego-curve.png" class="attachment-medium wp-post-image" alt="lego curve" title="lego curve" /></a></p>Legos are the best toys ever invented. That's indisputable fact. So it's no surprise that Mark Changizi et al. at &#8230;]]></description>
			<content:encoded><![CDATA[<p><a href="http://flowingdata.com/2012/01/12/lego-mathematics-and-growing-complexity-in-networks/"><img width="539" height="378" src="http://flowingdata.com/wp-content/uploads/2012/01/lego-curve.png" class="attachment-medium wp-post-image" alt="lego curve" title="lego curve" /></a></p><p>Legos are the best toys ever invented. That's indisputable fact. So it's no surprise that Mark Changizi et al. at Duke University used the toys in their <a href="http://www.wired.com/wiredscience/2012/01/the-mathematics-of-lego/">study of growing complexity of systems and networks</a>. They looked at 389 Lego sets and compared the number of pieces in the set to the number of piece types, as shown above.</p>
<blockquote><p>This curve demonstrates that as the number of pieces in a set grows, so do the number of piece types. However, the number of piece types grows <em>sublinearly</em>: while a larger set uses more piece types, as sets becomes larger, they use progressively fewer additional piece types (so larger sets actually use fewer types per piece). This is similar to other sublinear curves, where larger animals use less energy per cell for metabolism or larger cities actually need fewer gas stations per capita. Essentially, larger sets become more efficient, using the same pieces that smaller sets do, but in a more complex and diverse way.</p></blockquote>
<p>Now to more important matters... look at this cool thing I made with the Legos my wife got me for Christmas. It's a spaceship airplane race car. Despite leaps in technology from now to the Lego man's current time, he still has to talk into a walkie talkie that is bigger than his head.</p>
<p><img src="http://flowingdata.com/wp-content/uploads/2012/01/2012-01-09_17-21-23_741-625x352.jpg" alt="" title="Lego racecar spaceship airplane" width="625" height="352" class="alignnone size-medium wp-image-20787" /></p>
<p>This just in: While the number of Lego piece types increases with larger sets, the maturity of the Lego users exponentially decreases.</p>
<p>[<a href="http://www.wired.com/wiredscience/2012/01/the-mathematics-of-lego/">Wired</a> via @<a href="https://twitter.com/#!/ajturner/status/156405587498909698">ajturner</a>]</p>
]]></content:encoded>
			<wfw:commentRss>http://flowingdata.com/2012/01/12/lego-mathematics-and-growing-complexity-in-networks/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Predicting the future of prediction</title>
		<link>http://flowingdata.com/2012/01/09/predicting-the-future-of-prediction/</link>
		<comments>http://flowingdata.com/2012/01/09/predicting-the-future-of-prediction/#comments</comments>
		<pubDate>Mon, 09 Jan 2012 08:35:40 +0000</pubDate>
		<dc:creator>Nathan Yau</dc:creator>
				<category><![CDATA[Statistics]]></category>
		<category><![CDATA[Boston]]></category>
		<category><![CDATA[prediction]]></category>

		<guid isPermaLink="false">http://flowingdata.com/?p=20726</guid>
		<description><![CDATA[Tarot cards don't cut it anymore as a predictors. We turn to data for a look to the future: "We're &#8230;]]></description>
			<content:encoded><![CDATA[<p>Tarot cards don't cut it anymore as a predictors. We turn to <a href="http://articles.boston.com/2012-01-01/ideas/30575781_1_prediction-tarot-sheep">data for a look to the future</a>: </p>
<blockquote><p>"We're finally in a position where people volunteer information about their specific activities, often their location, who they're with, what they're doing, how they're feeling about what they're doing, what they're talking about," said Johan Bollen, a professor at the School of Informatics and Computing at Indiana University Bloomington who developed a way to predict the ups and downs of the stock market based on Twitter activity. "We've never had data like that before, at least not at that level of granularity." Bollen added: "Right now it’s a gold rush."</p></blockquote>
<p>Or you could just get yourself a <a href="http://www.thinkgeek.com/geektoys/plush/9fc6/">flux capacitor</a> and save yourself some time.</p>
<p>[<a href="http://articles.boston.com/2012-01-01/ideas/30575781_1_prediction-tarot-sheep">Boston</a>]</p>
]]></content:encoded>
			<wfw:commentRss>http://flowingdata.com/2012/01/09/predicting-the-future-of-prediction/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Teamwork and collaboration that built Watson</title>
		<link>http://flowingdata.com/2012/01/08/teamwork-and-collaboration-that-built-watson/</link>
		<comments>http://flowingdata.com/2012/01/08/teamwork-and-collaboration-that-built-watson/#comments</comments>
		<pubDate>Sun, 08 Jan 2012 21:30:50 +0000</pubDate>
		<dc:creator>Nathan Yau</dc:creator>
				<category><![CDATA[Statistics]]></category>
		<category><![CDATA[IBM]]></category>
		<category><![CDATA[teamwork]]></category>
		<category><![CDATA[Watson]]></category>

		<guid isPermaLink="false">http://flowingdata.com/?p=20711</guid>
		<description><![CDATA[Team lead, David Ferrucci, recalls the early days of putting together the team that built Watson: Likewise, the scientists would &#8230;]]></description>
			<content:encoded><![CDATA[<p>Team lead, David Ferrucci, recalls the early days of <a href="http://www.nytimes.com/2012/01/08/jobs/building-the-watson-team-of-scientists.html">putting together the team that built Watson</a>:</p>
<blockquote><p>Likewise, the scientists would have to reject an ego-driven perspective and embrace the distributed intelligence that the project demanded. Some were still looking for that silver bullet that they might find all by themselves. But that represented the antithesis of how we would ultimately succeed. We learned to depend on a philosophy that embraced multiple tracks, each contributing relatively small increments to the success of the project.</p></blockquote>
<p>As I sit here reading about egos within IBM, with the NFL playoffs in front of me, I can't help but smirk.</p>
<p>[<a href="http://www.nytimes.com/2012/01/08/jobs/building-the-watson-team-of-scientists.html">New York Times</a> via <a href="http://simplystatistics.tumblr.com/post/15483885506/building-the-team-that-built-watson">Simply Statistics</a>]</p>
]]></content:encoded>
			<wfw:commentRss>http://flowingdata.com/2012/01/08/teamwork-and-collaboration-that-built-watson/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

