• How to Make an Animated Growth Map in R

    Although time series plots and small multiples can go a long way, animation can make your data feel more real and relatable. Here is how to do it in R via the animated GIF route.

  • These days it’s relatively easy to figure out connections between people via email, Twitter, Facebook, etc. However, it’s harder to decipher relationships between people in the 17th century. Researchers at Carnegie Mellon and Georgetown University aim to figure that out in the Six Degrees of Francis Bacon.

    Historians and literary critics have long studied the way that early modern people associated with each other and participated in various kinds of formal and informal groups. Yet their scholarship, published in countless books and articles, is scattered and unsynthesized. By data-mining existing scholarship that describes relationships between early modern persons, documents, and institutions, we have created a unified, systematized representation of the way people in early modern England were connected.

  • The United States Census Bureau just released county-level commute estimates for 2011, based on the American Community Survey (that thing so many people seem to be against).

    About 8.1 percent of U.S. workers have commutes of 60 minutes or longer, 4.3 percent work from home, and nearly 600,000 full-time workers had “megacommutes” of at least 90 minutes and 50 miles. The average one-way daily commute for workers across the country is 25.5 minutes, and one in four commuters leave their county to work.

    The Bureau graphic isn’t very good [PDF], but WNYC plugged the data into a map, which is a lot more informative.

    There’s also a link to download the data on the bottom left of the WNYC map in CSV format, in case you want to try your hand at making a choropleth map. Or you can grab some flow data from the Census Bureau.

  • An old one from xkcd. I’m not sure whether to laugh or cry, but I think he’s implying that people who make graphs on weekends are super dateable.

  • Who’s going to be the next pope? I know all of you are sitting on the edge of your seats. Luckily, an analytical research manager who goes by the name AJ hacked together a pope tracker.

    Despite not being Catholic, the papal election fascinates me. Not sure if it’s the old rituals, the world-wide interest, or simply the fact that the Catholic Church has left a huge mark on history.

    There’s no way I know enough about the inner workings of the Catholic Church to have any idea on who the next Pope may be.

    Since domain knowledge is out, the next best option?

    Follow the money!

    He’s scraping odds of possible candidates becoming pope from a betting site, and the above shows the numbers over time. The odds were bumpy at first, but there seems to be some convergence, and as of this writing, Cardinal Peter Turkson from Ghana is the heavy favorite. [via Revolutions]

  • I’m not into video games, and my experience has been near zero since high school, but I’m excited about SimCity 2013 coming out tomorrow. I think my excitement comes from one part nostalgia and one part GlassBox — the game engine that drives the simulations of the city you build and its citizens:

    All the glowing reviews probably have something to do with interest, too. But that memory of installing SimCity 2000 from two floppy disks in my 486 totally brings back happy thoughts.

    Apparently, the game makers were inspired by Google Maps and information graphics to display the data generated during gameplay. I hope Maxis releases some of that data. It could be fun to compare SimCity demographics to the real world. Then again, who’s going to have time to look at the data, when we’ll be too busy building arcologies?

  • Andrew Leonard for Salon fears what might come of the creative process if movies are based on algorithms and data and that we might turn into puppets.

    For years Netflix has been analyzing what we watched last night to suggest movies or TV shows that we might like to watch tomorrow. Now it is using the same formula to prefabricate its own programming to fit what it thinks we will like. Isn’t the inevitable result of this that the creative impulse gets channeled into a pre-built canal?

    Because tastes never change? We don’t have any choice but to watch what is handed to us? Will creators stop making things that go against the norm? Leonard concludes with us stuck in a trance, in front of our televisions.

    The companies that figure out how to generate intelligence from that data will know more about us than we know ourselves, and will be able to craft techniques that push us toward where they want us to go, rather than where we would go by ourselves if left to our own devices. I’m guessing this will be good for Netflix’s bottom line, but at what point do we go from being happy subscribers, to mindless puppets?

    Again, the assumption is that we have no say in the matter. But when a company or service suggests that we buy or watch something, we don’t have to follow.

    Netflix in particular thrives by providing a service that shows us what they think we might want to watch from a selection of thousands of options. Part of that algorithm depends on our own movie ratings and preferences. If Netflix offers poor suggestions, you can leave the service. Yeah. You can stop paying 8 bucks a month.

    Let’s turn it around. What if Netflix analyzed viewing data not to offer their best viewing suggestions or to make shows and movies that people like but to expand people’s viewing windows? Let’s say that the data shows that you watch a lot of “witty, critically acclaimed comedies”, so Netflix suggests you watch more “romantic dramas” to make you more well-rounded. Are you a mindless puppet if you take the suggestion, even if you end up hating the movie? Are you a mindless puppet if you ignore the suggestion and continue watching what you know you like?

    From the production perspective, it makes sense to try to make something a lot of people like. From the consumer perspective, we still get to decide what we want to spend our money on.

    It’s good to be concerned about how companies use personal data. Data privacy, ownership, and ethics are important issues, but it shouldn’t mean a fear of all things data.

  • From the Winnipeg Sun. Something isn’t right here. [via]

  • StatelyAdd another way to make state-level choropleth maps. Stately, a project by Intridea, allows you to approach state mapping in the browser like you would a font.

    Stately is a symbol font that makes it easy to create a map of the United States using only HTML and CSS. Each state can be styled independently with CSS for making simple visualizations. And since it’s a font, it scales bigger and smaller while staying sharp as a tack.

    The process is fairly straightforward: Link to the Stately stylesheet, add some HTML markup (an unordered list of states) to your page, and then use CSS to color each state. Boom, you’ve got yourself a map.

  • In a follow-up to their map on most used languages in London, James Cheshire and Ed Manley, along with John Barratt, mapped the most commonly used languages in New York, based on the ones used on Twitter.

    English (in grey above) is by far the most popular with Spanish (in blue above) taking the top spot amongst the other language groups. Portuguese and Japanese take third and fourth respectively. Midtown Manhattan and JFK International Airport have, perhaps unsurprisingly, the most linguistically diverse tweets whilst specific languages shine through in places such as Brighton Beach (Russian), the Bronx (Spanish) and towards Newark (Portuguese). You can also spot international clusters on Liberty Island and Ellis Island and if you look carefully the tracks of ferry boats between them.

  • It’s not especially straightforward to know or find out what’s going on with your state’s government. Sites aren’t maintained, are unusable, or just don’t provide much information. Open States, a project by the Sunlight Foundation, aims to change that.

    After more than four years of work from volunteers and a full-time team here at Sunlight we’re immensely proud to launch the full Open States site with searchable legislative data for all 50 states, D.C. and Puerto Rico. Open States is the only comprehensive database of activities from all state capitols that makes it easy to find your state lawmaker, review their votes, search for legislation, track bills and much more.

    Just click on a state or enter an address, and you can quickly get information that’s relevant to where you are. There’s also iPhone and iPad apps if you prefer those, and all the data on the site is accessible via an API or a bulk data dump.
    Read More

  • Cardinal compositeWith Pope Benedict XVI’s resignation, 116 cardinals from various regions have to come a consensus on who will be next. Amanda Cox and Graham Roberts for The New York Times wondered what a composite of all the cardinals might look like, which looks exactly how you might expect the average to look.