Food trucks are the real deal these days. The best ones serve a specialized menu really well, in a small, focused space. The Washington Post delves into the insides of several of these trucks and how they make the food with very specific equipment.
-
Educate Your Child by Gabrielle LaMarr LeMee uses census data and the school selection process to simulate the steps you might take in choosing your kid’s first school in Chicago.
The Chicago public school system has a high level of school segregation as a result of parent’ residential and school choices as well as policy decisions that do not encourage integrated neighborhoods and schools.
In this game, you are a parent of a 5-year-old child and now you have to make some decisions. Explore how your choices can have an impact on your child’s education and on the overall education of the city’s children.
There should be more games like this based on census data. It seems to be a good way for an individual to latch on to data points while still getting a view of the grand scheme of things.
See also more on LeMee’s design for details on modeling school choice.
-
Imagine that those with immigrants in their family tree left the country. Almost everyone, basically.
-
Adam Pearce for The New York Times describes the sad state of affairs that is the delayed subway trains in New York. One delay causes a ripple effect down the line, leaving little chance to get back on track. The more straightforward figures gear you up for the overall view at the end.
This was for New York specifically but is applicable to other transits and forms of transportation. See also the traffic gridlock simulation from a few years ago. It doesn’t take much for gridlock.
-
Financial Times recently updated their style guide:
data — the rule for always using data as plural has been relaxed. If you read data as singular then write it as such. For example, we already allow singular for ‘big data’. And we should for personal data too. An easy rule would be that if it can be used as a synonym for information then it should probably be singular — and if we are using it as economic data and mean figures, then we should stick to plural.
And for kicks, I dug up my New York Times style guide from 1999:
data is acceptable as a singular term for information: The data was persuasive. In its traditional sense, meaning a collection of facts and figures, the noun can still be plural: They tabulate the data, which arrive from bookstores nationwide. (In this sense, the singular is datum, a word both stilted and deservedly obscure.)
Data are sounds weird to me most of the time. When I say it like that, I feel like I should also drink a cup of tea with my pinky sticking out and a monocle firmly planted for distinction.
-
CBS News picked up four used photocopiers and looked at the hard drives. There was a lot of private information stored in them:
Nearly every digital copier built since 2002 contains a hard drive – like the one on your personal computer – storing an image of every document copied, scanned, or emailed by the machine.
In the process, it’s turned an office staple into a digital time-bomb packed with highly-personal or sensitive data.
If you’re in the identity theft business it seems this would be a pot of gold.
“The type of information we see on these machines with the social security numbers, birth certificates, bank records, income tax forms,” John Juntunen said, “that information would be very valuable.”
Okay, save the dramatics, it’s still disconcerting.
Not every photocopier makes it so easy to access copied documents, but it’s surprising that it’s still so straightforward with some machines these days. Then again, part of the responsibility belongs to the previous owners. As the Federal Trade Commission instructs, it’s like getting rid of a computer.
-
Simone Giertz, bringer of joy and self-described expert in shitty robots, makes machines that succeed in failing. In her TED talk, Giertz talks about her path from “useless” things to expert. It’s all the more relevant after she found out she has a brain tumor.
[arve url=”https://www.youtube.com/watch?v=c0bsKc4tiuY” /] Read More
-
Eric Rodenbeck from Stamen Design discusses visualization the medium over visualization the tool or the insight-providing image:
Dataviz! Data visualization! I don’t think it’s for anything! I don’t believe it’s meaningful to say that dataviz is for one thing, any more than it’s meaningful to say that architecture is for any one thing. Or that photography is for one thing, that it has a purpose that can be defined in a sentence or two. Or that movies are for one thing, one that you could win an argument about.
Yes.
See also Martin Wattenberg and Fernanda Viegas’ talk from a while back on the parallels between books and visualization.
-
The 2020 Census is coming up quick, but there’s still a lot up in the air. There’s no director, the bureau has to adjust to budget cuts, and a new digital system that promises to save money hasn’t been fully tested (because of lower funding). Exciting. Alvin Chang for Vox explains in more detail — with cartoons.
-
Most of us don’t read the terms and conditions before we click on “I agree” for the web services we use. They’re too long, and we need likes right away. For a student project, Dima Yarovinsky printed the terms and conditions on paper for major social apps — WhatsApp, Google, Tinder, Twitter, Facebook, Snapchat, and Instagram, respectively — which highlights what we’re getting into. [via @hailmika]
-
3-D Printing: How to Prepare the Data in R
Moving your data from the digital screen to something more physical isn’t as tricky as it seems. Here’s how I did it.
-
By Neil Freeman, the @everytract bot on Twitter, as the name suggests, is tweeting a map of every Census tract in numerical order. It’s one map each half hour.
Census data, or data in general really, is typically in aggregate or about the overall trends, which requires an abstract view of a bunch of data points pushed together. So it’s nice to see a straightforward project put focus on the individual.
Of this genre, the censusAmericans bot is my favorite. It tweets people’s biographies based on data from the American Community Survey.
-
Kit Chellel for Bloomberg tells the riveting gambling story of Bill Benter, who used statistics to model horse-racing in Japan. My favorite part is the pre-Internet process Benter took to collect data and predict results:
Benter’s model required his undivided attention. It monitored only about 20 inputs—just a fraction of the infinite factors that influence a horse’s performance, from wind speed to what it ate for breakfast. In pursuit of mathematical perfection, he became convinced that horses raced differently according to temperature, and when he learned that British meteorologists kept an archive of Hong Kong weather data in southwest England, he traveled there by plane and rail. A bemused archivist led him to a dusty library basement, where Benter copied years of figures into his notebook. When he got back to Hong Kong, he entered the data into his computers—and found it had no effect whatsoever on race outcomes. Such was the scientific process.
As I said with the lottery–hacking stories: I need to gamble more. This is statistics’ true purpose, right?
-
Roger Peng discusses the importance of managing the relationships between people — analyst, patron, subject matter expert, and audience — for a successful analysis:
Human relationships are unstable, unpredictable, and inconsistent. Algorithms and statistical tools are predictable and in some cases, optimal. But for whatever reason, we have not yet been able to completely characterize all of the elements that make a successful data analysis in a “machine readable” format. We haven’t developed the “institutions” of data analysis that can operate without needing the involvement of specific individuals. Therefore, because we have not yet figured out a perfect model for human behavior, data analysis will have to be done by humans for just a bit longer.
Whenever someone touts a tool for “automatic insights”, whether it be in analysis or chart generation, something like this comes to mind.
-
Aaron Williams and Armand Emamdjomeh for The Washington Post delve into diversity and segregation in the United States. The boiling pot continues to get more ingredients, but they’re not mixing evenly.
Some 50 years ago, policies like the Fair Housing Act and Voting Rights Act were enacted to increase integration, promote equity, combat discrimination and dismantle the lingering legacy of Jim Crow laws. But a Post analysis shows that some cities remain deeply segregated — even as the country itself becomes more diverse.
I like how you can easily toggle between diversity and segregation. It allows for a quick comparison of metrics that aren’t always clear-cut.
Scroll to the end to see how diversity and segregation compare in your area.
-
You’ve probably heard of the wisdom of crowds. The general idea, popularized by James Surowiecki’s book, is that a large group of non-experts can solve problems collectively better than a single expert. As you can imagine, there are a lot of subtleties and complexities to this idea. Nicky Case helps you understand with a game.
Draw networks, run simulations, and learn in the process. The game takes about a half an hour, so set aside some time to play it through.
-
We almost always look at data through a screen. It’s quick and good for exploration. So is there value in making data physical? I played around with a 3-D printer to find out.
-
Jonathan Corum, the Science graphics editor at The New York Times, talks about his experiences communicating scientific research to the public. Much of visualization design is about figuring out the audience and making graphics for that audience, so Corum uses a lot of examples that start from technical research papers and finish with a more focused result.
-
Taylor Baldwin mapped all of the buildings in Manhattan using a 3-D layout. Rotate, zoom, and pan, and be sure to mess around with the parameters in the control panel for different looks. Also make sure you try it in Chrome, because it’ll probably send your computer fan whirling.
-
[arve url=”https://www.youtube.com/watch?v=bPZFQ6i759g” /]
With Numberphile, Lisa Goldberg discusses her research with Alon Daks and Nishant Desai at the University of California, Berkeley on the hot hand in basketball. When a player is hitting shots, is he more likely to hit the next one? The experiment results suggest that the hot hand is actually just randomness.
That said, there are other points of view on this topic.
As a statistician, I don’t think the hot hand exists mathematically, but as a sports fan, I’m more than happy to ride the wave of excitement.