• July 7, 2014

    Topic

    Maps  /  ,

    Smarty Pins is a simple, fun map game by Google. You get a trivia clue about some location, and the goal is to drop the pin as close as you can to the correct place. You start with 1,000 miles, and you get docked each time for how far your choice was from the actual location.

    For more on how little you know about where stuff is, see also the state matching game and the Mercator map puzzle.

  • We produce data all the time, everywhere we go, and this process implies something about how we live. Jer Thorp explains in this short explainer video animated by Erica Gorochow.

  • Jaap de Maat, a graduate student at the Royal College of Art, rigged a filing cabinet to follow people around for his final project. It reminds people of the data traces we leave behind. It’s called I know what you did last summer.

    It is physically impossible for the human brain to remember every event from our past in full detail. The default setting is to forget and our memories are constructed based on our current values. In the digital age it has become easier to look back with great accuracy. But this development contains hidden dangers, as those stored recollections can easily be misinterpreted and manipulated. That sobering thought should rule our online behaviour, because the traces we leave behind now will follow us around for ever.

    See more details on Wired.

  • Terry Speed, a emeritus professor in statistics at University of California at Berkeley, gave an excellent talk on how statisticians can play nice with big data and data science. Usually these talks go in the direction of saying data science is statistics. This one is more on the useful, non-snarky side.

  • Jobs and pay can vary a lot depending on where you live, based on 2013 data from the Bureau of Labor Statistics. Here’s an interactive to look.

  • Ditch the computer screen for your data. It’s all about the food. Moritz Stefaner and prozessagenten, process by art and design ran a second round of the Data Cuisine workshop to explore how food can be used as a medium to communicate data. Naturally, you’ve got your basic visual cues, but when you introduce food, you open lots more possibilities.

    [W]e have all kinds of sculptural 3D possibilities. We can work with taste — from the basic tastes of sweet, sour, salty, bitter, umami to complex combinations or hotness. There is texture — immensely important in cooking! Then we have all the cultural connotations of ingredients and dishes (potatoes, caviar, …). We can work with cooking parameters (e.g. baking temperature or duration). Or the temperature of the dish itself, when served!

    The above shows piece of bread shows youth unemployment in Spain. See more data dishes here.

  • Julia Angwin and Jeff Larson for ProPublica made a chart of NSA programs revealed in the past year. Programs were plotted subjectively from foreign to domestic surveillance on the horizontal axis and targeted to bulk surveillance on the vertical. So you get more controversial the further you move up towards the top right corner.

    Interesting stuff.

    The best part though is the goofy program names, as illustrated by Alberto Cairo. ParanoidSmurf and his siblings Nosey, Tracker, and Dreamy; EgotisticalGoat and EgotisticalGiraffe; WillowVixen. First off, who names these programs? And second, how do I get in on the naming action (without becoming creepy)?

  • Animation Domination High-Def has a Captain America video of things that America is not so good at, relative to other countries. And they even cited their data source, the CIA World Factbook. How about that.

  • Media artist Nick Hardeman’s audio visualization app Bass Shapes was rejected by the Mac App Store because “it’s not useful.” So Hardeman released the software as a free OS X download instead. It’s a beauty.

    The app takes in sound input from your microphone or an external audio source through Soundflower (also free), and the visuals come to life. Watching Bass Shapes, you’d swear that you were seeing a custom, hand-drawn animation that served as some kind of old-school-ish intro to an animated film. But you’d be wrong.

    Download Bass Shapes and try it yourself.

  • Mike Bostock, who you might recognize from such things as Data-Driven Documents or the New York Times, writes on the value of visualizing algorithms for entertaining, teaching, learning, and debugging.

    Algorithms are a fascinating use case for visualization. To visualize an algorithm, we don’t merely fit data to a chart; there is no primary dataset. Instead there are logical rules that describe behavior. This may be why algorithm visualizations are so unusual, as designers experiment with novel forms to better communicate. This is reason enough to study them.

    But algorithms are also a reminder that visualization is more than a tool for finding patterns in data. Visualization leverages the human visual system to augment human intellect: we can use it to better understand these important abstract processes, and perhaps other things, too.

    At the very least, you’ll have fun scrolling through the animated visuals that show how various algorithms work, but read the whole thing. It’s good.

  • Watch the notes rain down on those piano keys. Made with MIDItrail, a MIDI player with a 3-d visualization component. The program (and this video) has been around for a while, but it’s new to me.
    Read More

  • Blitzortung is a community of volunteers who install inexpensive lightning sensors and transmit their data to a central server. In return, those who run the sensors have access to the network’s data. The map that runs on the site shows the data in near real-time, providing a view of lightning strikes around the world. Pretty neat that this exists.

  • DataShine Census provides a detailed view into United Kingdom 2011 census data. Population, housing, income, commute, and other variables are available.

    The DataShine mapping platform is an output from an ESRC Future Research Leaders Project entitled “Big Open Data: Mining and Synthesis”. The overall project seeks promote and develop the use of large and open datasets amongst the social science community. A key part of this initiative is the visualisation of these data in new and informative ways to inspire new uses and generate insights. Phase one has been to create the mapping platform with data from the 2011 Census. The next phases will work on important issues such as representing the uncertainty inherent in many population datasets and also developing tools that will enable the synthesis of data across multiple sources.

    They’re off to a good start.

  • Not sure where this is from, but feel that tingle in the back of your head? That’s the feeling of your mind blowing up.

  • Mirador, a collaborative effort led by Andrés Colubri from Fathom Information Design, is a tool that helps you find correlative patterns in datasets with a lot of variables and observations. It’s in the early stages of development, but is available to use and test on Windows and Mac. Colubri explains the process, from its early stages to its current iteration.

    Although fields like Machine Learning and Bayesian Statistics have grown enormously in the past decades and offer techniques that allows the computer to infer predictive models from data, these techniques require careful calibration and overall supervision from the expert users who run these learning and inference algorithms. A key consideration is what variables to include in the inference process, since too few variables might result in a highly-biased model, while too many of them would lead to overfitting and large variance on new data (what is called the bias-variance dilemma.)

    Leaving aside model building, an exploratory overview of the correlations in a dataset is also important in situations where one needs to quickly survey association patterns in order to understand ongoing processes, for example, the spread of an infectious disease or the relationship between individual behaviors and health indicators.

    Download Mirador to try it for yourself.

  • After looking at pizza places, coffee, and grocery stores, I had to look at burger chains across the country. The data was just sitting there.

  • Many of us aren’t aware of how one country compares to others or public policy that has been around for decades. How Wrong You Are is a simple quiz game by Moiz Syed and Juliusz Gonera that tests such knowledge.

    How Wrong You Are is a collection of important questions that people are sometimes misinformed about. We poll you to measure how right — or how wrong — the public is about these important questions.

    Every week, we will add a new question. These are all questions that we hope you already know. But if you don’t, don’t worry! You learned something. Share your results, successful or not. Chances are, if you didn’t know this question, other people might not, either.

    Play the game here. At the very least, you’ll learn something new.

  • While we’re on the topic of NYC taxi data, Eric Fischer for Mapbox mapped all 187 million trips. Each observation contains the start and end location of a trip, so blue dots represent the former and orange represent the latter. My favorite bit is on the data collection artifacts, such as the map above.

    The patterns at JFK and LaGuardia airports show interesting artifacts of the data collection process. Almost all of the trips there must have really begun or ended right at the terminals, but many of them are attributed to the roads leading to and from the airports, where the last good GPS fix must have occurred.

    See also the New York Times animated map from several years ago that shows taxi activity during days of the week.

  • Based on data compiled from a combination of military records, Defense Department records, and drone manufacturers, Emily Chow, Alberto Cuadra and Craig Whitlock for the Washington Post provide a quick view into drone crashes.

    More than 400 large U.S. military drones crashed in major accidents worldwide between Sept. 11, 2001, and December 2013. By reviewing military investigative reports and other records, The Washington Post was able to identify 194 drone crashes that fell into the most severe category: Class A accidents that destroyed the aircraft or caused (under current standards) at least $2 million in damage.

    The top row represents where a drone crashed, the second row who owns it, and the third tells the type. Mouse over any of the tick marks, and you get details for the corresponding crash.