• Distance to Mars

    April 8, 2013 to Infographics by Nathan Yau

    Distance to Mars

    Long distances (and big numbers) can be difficult grasp. Designers Jesse Williams and David Paliwoda took a stab at it and made it easier to understand the distance from Mars. Simple and totally fun. I'm not sure how accurate the travel time and distance are, but I'm guessing it takes differing orbits into account.

  • A bar chart would be better

    April 8, 2013 to Visualization by Nathan Yau

    There's a strand of the data viz world that argues that everything could be a bar chart. That's possibly true but also possibly a world without joy.

    —Amanda Cox, 2013

    There's a great interview with Amanda Cox from The New York Times on visualization, some of the skills required, and where the field is headed. I like this tidbit on design, which is a contrast to the above:

    Design and typography do matter. It's about hierarchy of information and how people perceive information. Done properly, that clean up work really matters. On the other hand, it's easy to believe that it matters more than it does. If you make a fantastically interesting chart and some poor design decisions, the data will still come through. If you make a bad chart with a beautiful design, what have you done, really?

    Read the whole thing. Thank me later.

  • Wall shelf represents water in snowpack

    April 5, 2013 to Data Art by Nathan Yau

    Snow Water Equivalent Cabinet

    Melting snowpacks feed into streams and rivers and serve as a source of water for nearby communities. The Snow Water Equivalent Cabinet by artist Adrien Segal represents the amount of water in snowpack in Ebbetts Pass, California.

    Each drawer is one year of data for a total of 31 years - 1980 - 2010. The size of the drawer is directly related to the amount of water stored in the snowpack for the given year. Some of the drawers are so shallow that they are barely functional. Wet years have larger drawers.

    I understand the metaphor behind the limited functionality at low water points, but a totally functional version would be a sexy piece in a studio. Snow Water is currently on display at the Richmond Art Center as part of the Innovations in Contemporary Crafts exhibition until June 1. [Thanks, Michael]

  • Introducing Data Points

    April 4, 2013 to Data Points by Nathan Yau

    Whoa, that was fast. Data Points is now available. Thanks to all of you for making this possible.

  • Data Points: Preview

    April 3, 2013 to Data Points by Nathan Yau

    Data Points by Nathan Yau

    This appeared at my door today. It's awesome.

    I suspect those who pre-ordered Data Points (thanks!) should receive their copies soon.
    Continue Reading

  • Problematic databases used to track employee theft

    April 3, 2013 to Data Sharing by Nathan Yau

    Employee theft accounts for billions of dollars of lost merchandise per year, so it's a huge concern for retailers, but it often goes unreported as a crime. If only there were reference databases where business owners could report offenders and look up potential employees to see if they have ever stole anything. It turns out there are, but the systems have proved to be problematic.

    "We're not talking about a criminal record, which either is there or is not there — it's an admission statement which is being provided by an employer," said Irv Ackelsberg, a lawyer at Langer, Grogan & Diver who represents Ms. Goode.

    Such statements may contain no outright admission of guilt, like one submitted after Kyra Moore, then a CVS employee, was accused of stealing: "picked up socks left them at the checkout and never came back to buy them," it read. When Ms. Moore later applied for a job at Rite Aid, she was deemed "noncompetitive." She is suing Esteem.

    On paper, the data sounds great for business owners, and keeping such data also seems like a fine business to run. Thefts go down and owners can focus on other aspects of their business. The challenge and complexity comes when we remember that people are involved.

  • How a cab driver makes money

    April 3, 2013 to Infographics by Nathan Yau

    Cabbie money

    According to the Bureau of Labor Statistics, cab drivers and chauffeurs make a median salary of $22,400 per year, or $10.79 an hour. (I believe that's not including tips.) Using about three months of fare data from a single driver, Alvin Chang for The Boston Globe showed how a driver makes a living day-to-day.

    Time runs left to right, and each column represents fares collected in a day. A driver starts each day in the red when he or she leases a cab for $125, which includes gas, and then works into the blue.

    After an animation plays out over a few seconds, you can click to zoom in and see specific fares. I expected to drag left and right once zoom, but the chart just zooms back out. I suspect the interaction is mostly there for people on mobile devices. I also wanted to scrub the vertical line that indicates time to see details for spikes or days no fares were collected.

    So there's still a bit to be desired here, but the data itself is interesting, which makes it worth a look.

  • Vega: A visualization grammar to create without programming

    April 2, 2013 to Software by Nathan Yau

    Population with Vega

    Visualization online can be a challenge if you don't know how to program. Analytics startup Trifacta just lightened the load with Vega, a "visualization grammar" that lets you create and share by editing a JSON file. Check out the demo live editor to see how this works. Select different chart types from the drop down menu on the top left, which you can render in HTML5 Canvas or SVG.

    Of note: Vega is built on top of Data-Driven Documents.

    To get right to the point: Vega is NOT intended as a "replacement" for D3. D3 is intentionally a low-level system. During the early design of D3, we even referred to it as a "visualization kernel" rather than a "toolkit" or "framework". In addition to custom design, D3 is intended as a supporting layer for higher-level visualization tools. Vega is one such tool, and leverages D3 heavily within its implementation.

    Gonna keep an eye on this one.

  • An experimental map service using 3-D data

    April 2, 2013 to Mapping by Nathan Yau

    Stamen Here

    For the past few months, Stamen Design has been working with 3-D data from Nokia's Here. Something pretty came out of the experiment.

    For your viewing, embedding, linking, and otherwise internet-ing pleasure: http://here.stamen.com/ is live today. It uses 3D data from HERE for San Francisco, New York, London, and Berlin to create city-wide 3D browsable maps, and it does this in the browser (though you'll need a WebGL-enabled browser to see it). As in many of our other mapping projects, the urls change dynamically depending on location and other factors, and the data conforms, more or less, to the Tile Map Service specification. What this means, among other things, is that it's not only possible to link to and embed these maps at specific locations and zoom levels, but that it's easy—and as we've seen with Citytracking, easy is good.

    There are a bunch of views to play with, and you should try all of them. My favorites though are the city-planning look in Pinstripe and the glowing aesthetic of the height view.

  • 01-start-finish

    A Survival Guide to Starting and Finishing a PhD

    Tips on making it through, what I would tell my previous self going in, and advice on taking advantage of the unique opportunity that is graduate school.
  • Chartspotting: Coffee graph menu

    March 29, 2013 to Statistical Visualization by Nathan Yau

    Coffee menu

    FlowingData reader Amir sent this along. In lieu of a list of coffee drinks, this place in in East London opted for ingredient breakdowns. I'm guessing there's a standard menu outside the frame, because otherwise, coffee neophytes (like me) would have no clue what to do. Anyone care to fill in the blanks?

    Spot any charts in the wild? You should email me a picture.

  • Gun deaths since Sandy Hook

    March 28, 2013 to Mapping by Nathan Yau

    Gun deaths since Sandy Hook

    The shooting at Sandy Hook Elementary School was horrible, but there have been thousands of gun deaths since. Huffington Post is mapping them.

    Circles represent the number of deaths in a city, and the larger a circle the higher the count. A bar chart on the bottom shows the data over time and serves as a navigation device. Click on a day or a location, and the names of victims appear on the right with a link to the related news story.

    See also: Periscopic's work on the topic, which now has filters and is updated in real-time.

    Also: episodes 487 and 488 of This American Life, which focus on Harper High School in Chicago, where gang violence is a daily concern.

  • Metrico, an infographic puzzle game

    March 28, 2013 to Visualization by Nathan Yau

    Metrico is a puzzle action game for PlayStation Vita that centers around charts and graphs. The creators call them infographics, but whatever.

    The idea has been in our heads for a few years, and was born out of noticing how beautiful infographics can look as an art form. It was reinforced by seeing that infographics have become increasingly important in contemporary pop-culture. While they haven’t made their way to videogames yet, we think it’s a place where they can work exceptionally well. This is not just because of their pretty aesthetics as much as it is about actively changing data and how that can be visualized.

    The teaser above shows a guy running on and jumping over bar and line charts, and the last sentence of the paragraph seems to suggest that these things will be based on actual data. I kind of doubt it though.

    How about a SimCity-like game that uses real-time crime, traffic, and government data? Now that'd be something. They already kind of do that for sports games with injuries and starting lineups. [Thanks, Raphael]

  • Forecast: A weather site that’s easier to read

    March 27, 2013 to Online Applications by Nathan Yau

    Forecast

    When you go to one of the major sites to look up the weather, it's often hard to find what you're looking for. The sites feel dated, there isn't much hierarchy to the information, and navigation gets buried in the show-as-much-information-as-possible-on-the-same-page approach. Forecast, a site by the makers of the Dark Sky app, hopes to improve that experience during those times you need more than the high and lows for the day from the nearest widget.

    When you visit Forecast, you notice a difference right away. There's a map with local, regional, and global views, the temperature in large print on the right, and there are descriptions about what to expect that are easy to understand.

    From there, you get your daily forecasts below the map with details on demand. So you can get a lot of the same information that you get from larger sites, but you don't get hit with a bunch of data at once, and when you request more information, you get it quickly.

    There's also an API. Forecast and the Dark Sky app both run on it, which is the cherry on top of the goodness.

    I usually go to Matthew Ericson's minimalist weather page when I'm figuring out when to ride my bike or mow the lawn. Forecast might be my new weather destination for a while.

  • How to become a password cracker in a day

    March 26, 2013 to Statistics by Nathan Yau

    Deputy editor at Ars Technica Nate Anderson was curious if he could learn to crack passwords in a day. Although there's definitely a difference between advanced and beginner crackers, openly available software and resources make it easy to get started and do some damage.

    After my day-long experiment, I remain unsettled. Password cracking is simply too easy, the tools too sophisticated, the CPUs and GPUs too powerful for me to believe that my own basic attempts at beefing up my passwords are a long-term solution. I've resisted password managers in the past over concerns about storing data in the cloud or about the hassle of syncing with other computers or about accessing passwords from a mobile device or because dropping $50 bucks never felt quite worth it—hacks only happen to other people, right?

    But until other forms of authentication take root, the humble password will form a primary defense of our personal information. The time has come for me to find a better solution to generating, storing, and handling them.

    I use 1Password.

  • March Madness fan map

    March 26, 2013 to Mapping by Nathan Yau

    Along the same lines as their NFL fan maps, Facebook had a closer look at March Madness fandom, based on likes for team pages. In the map below, each county is colored by the conference liked the most.

    March Madness map

  • Every known drone attack in Pakistan

    March 25, 2013 to Visualization by Nathan Yau

    Drone attacks

    It's hard to know the impact of drone attacks as outsiders looking in, because the United States government doesn't disclose the information. Using data maintained by the Bureau of Investigative Journalism, which is estimates based on reports from the ground, Pitch Interactive sheds some light on every known drone attack in Pakistan.

    Since 2004, the US has been practicing in a new kind of clandestine military operation. The justification for using drones to take out enemy targets is appealing because it removes the risk of losing American military, it's much cheaper than deploying soldiers, it's politically much easier to maneuver (i.e. flying a drone within Pakistan vs. sending troops) and it keeps the world in the dark about what is actually happening. It takes the conflict out of sight, out of mind. The success rate is extremely low and the cost on civilian lives and the general well-being of the population is very high. This project helps to bring light on the topic of drones. Not to speak for or against, but to inform and to allow you to see for yourself whether you can support drone usage or not.

    Again, these are estimates, so the numbers might be higher or lower, but the point is that these attacks exist, and civilians and children are often involved.

  • Data Points: What it’s like to write a book

    March 25, 2013 to Data Points by Nathan Yau

    Data Points numbers

    As the publication of Data Points nears, I'm excited to hold it in my hands just like I was the first time. It feels weird to say that. In college, a 5-page report seemed like too much to handle, and I would hunt for fonts that took the most space and fiddled with margins to produce more pages, without making it look like I did. I guess a lot can happen in 10 years. Heck, a lot can happen in a few months.

    I think the difference is that now I'm writing about something that's interesting to me — topics that I immerse myself in for fun — which makes the book-writing process fun.

    Sure, it can be challenging at times, but in the best way possible. Here's my experience with Data Points.
    Continue Reading

  • Odds of a perfect NCAA March Madness bracket

    March 22, 2013 to Statistics by Nathan Yau

    Math professor Jeff Bergen explains the odds of picking a perfect bracket.

    The first probability is based on a 50/50 split of correct picks, which is like using fair coin flips to pick winners. Bergen doesn't really go into how he calculated the second probability, but that smaller number comes up by bumping up the probability of picking the right team for each game. I think he's using an average probability of slightly less than 70% (based on simulation results from this old Wall Street Journal column).

    That's why businesses can offer up million dollar prizes. In all likelihood, no one is going to win, which turns out to be a great business model for insurance companies who back these contests:

    If millions of people enter a particular contest, it might seem like the chance of someone winning is suddenly in the realm of possibility. But there's a catch: This scenario assumes everyone maximized their chances by picking mostly favorites, so those with the best shot at winning are likely to have identical entries. These contests generally protect themselves from big losses by stating they'll divvy up the loot if there are multiple perfect brackets.

    These favorable conditions make insuring these prize offers a good business, as the Dallas company SCA Promotions has discovered. SCA, founded by 11-time world bridge champion Robert D. Hamman, has taken on the insurance risk for roughly 50 perfect-bracket prizes -- including a Sporting News offer of $1 million in 2001, according to vice president Chris Hamman, the founder's son. In the 12 years it has been doing so, SCA has never had to pay out a claim.

  • A visualization of pi for high school math students

    March 22, 2013 to Data Art by Nathan Yau

    On Kickstarter: A project that uses a visualization of pi to connect Brooklyn high school students to their community.

    They've already made a histogram of emotions in their school's hallway and a stacked area chart mural at a nearby senior center. Next up is a wall currently covered in graffiti.

    In Math class, students will construct the golden spiral based on the Fibonacci Sequence and begin to explore the relationship between the golden ratio and Pi. The number Pi will be represented in a color-coded graph within the golden spiral. In this, the numbers will be seen as color blocks that vary in size proportionately within the shrinking space of the spiral, allowing us to visualize the shape of Pi and it's negative space.

    Backed.

Unless otherwise noted, graphics and words by me are licensed under Creative Commons BY-NC. Contact original authors for everything else.