I’m not sure there’s any way to really understand the scale of the largest black holes in the universe, but Kurzgesagt gives it a good try.
-
One of my favorites of the year, Sam Learner’s River Runner shows you a terrain map that lets you place a drop of rain anywhere in the contiguous United States. You’re then taken on a river tour that shows where the drop ends up. Learner just expanded the project to let you drop water anywhere in the world.
-
The Washington Post and ProPublica analyzed Facebook group posts that disputed election results:
To determine the extent of posts attacking Biden’s victory, The Post and ProPublica obtained a unique dataset of 100,000 groups and their posts, along with metadata and images, compiled by CounterAction, a firm that studies online disinformation. The Post and ProPublica used machine learning to narrow that list to 27,000 public groups that showed clear markers of focusing on U.S. politics. Out of the more than 18 million posts in those groups between Election Day and Jan. 6, the analysis searched for words and phrases to identify attacks on the election’s integrity.
The more than 650,000 posts attacking the election — and the 10,000-a-day average — is almost certainly an undercount. The ProPublica-Washington Post analysis examined posts in only a portion of all public groups, and did not include comments, posts in private groups or posts on individuals’ profiles. Only Facebook has access to all the data to calculate the true total — and it hasn’t done so publicly.
Read more about the methodology behind the analysis.
-
May Louise Kelly for NPR spoke briefly with biostatistics professor Natalie Dean on the omicron surge and what we can take away from the data:
Yeah, I mean, the public health impact is made up a lot of different things, and we’re most acutely interested in severe disease and death. But, of course, infections have impacts and we think about the disruption – you know, all the people who are going to need to miss work, including health care professionals and, you know, frontline workers. So the numbers have meaning, but it is a different public health impact when someone is mildly ill or doesn’t even have symptoms than when someone is severely ill.
The numbers are up but they don’t mean the same thing from when the numbers were up last time.
-
Sarah Kliff and Aatish Bhatia for NYT’s The Upshot look at the uncertainty of prenatal tests for rare conditions. For some tests, the results are more often wrong than they are right, which causes issues when expecting parents don’t know that.
Along with square pie charts, the piece goes into more detail with unit charts to explain what the percentages mean from a counts point of view. So if a reader doesn’t quite know what a false positive is before reading, they will have a better idea after.
-
Phew, just made it. These are my favorite data visualization projects from 2021.
Like last year, there were many Covid-related charts on the internets this year. While they are important to gauge the state of things, I found myself veering away from them to focus on other areas. I craved distraction, practical information for the times, and anything outside the bubble.
-
Members Only
-
This is Hallgrímskirkja, a church in Reykjavík, Iceland. It will now also be known as the Church of the 95% Confidence Interval.
The Church of the 95% Confidence Interval #rstats pic.twitter.com/m0gvvu2Dav
— Julien Cloarec (@CloarecJulien) December 5, 2021
-
The David Rumsey Map Center at Stanford houses hundreds of thousands of maps dating back to the 1500s. Andres Picon for San Francisco Chronicle wrote about the collection:
At the heart of that endeavor is the digitization of Rumsey’s vast physical collection, a project he began in the late 1990s when he launched davidrumsey.com, a constantly growing aggregation of about 112,000 digitized historical maps from his personal inventory. Rumsey, 77, is in the process of donating his entire map collection — more than 200,000 physical maps plus the digital ones — to Stanford so that they can be cataloged for the enjoyment of generations to come.
“It’s not only a database; it allows people to get lost inside it, no pun intended,” he said. “If you make it really usable and accessible the way ours is, it just becomes something different.”
For preservation, I wish we saw more of this and less blockchain. Hundreds of years from now, how much visualization work is still viewable?
You can view a large portion of the Rumsey collection here. You can also browse the data visualization tag to see some of the earliest made charts.
-
Justin Sherman for Wired points out the farce that is anonymized data:
Data on hundreds of millions of Americans’ races, genders, ethnicities, religions, sexual orientations, political beliefs, internet searches, drug prescriptions, and GPS location histories (to name a few) are for sale on the open market, and there are far too many advertisers, insurance firms, predatory loan companies, US law enforcement agencies, scammers, and abusive domestic and foreign individuals (to name a few) willing to pay for it. There is virtually no regulation of the data brokerage circus.
-
Denise Lu and Albert Sun for The New York Times show the shifts in Covid-19 deaths among different demographic groups:
The change in death rates among groups is starker by race and ethnicity, and the death rate has risen particularly sharply for middle-aged white people. Covid-19 now accounts for a much larger share of all deaths for that group than it did before vaccines were widely available.
In a series of slope charts, each multiple shows a group, and the background color indicates an increase (red) or a decrease (gray) in deaths among that group.
-
xkcd makes Statistics so fulfilling. George E. P. Box comes to mind.
-
Magnus Carlsen continued to assert his dominance at the World Chess Championship. FiveThirtyEight broke down Carlsen’s dominance in the final match with Ian Nepomniachtchi with a series of difference charts. In the quick view, you see it was either a draw or a Carlsen win over 11 games.
-
Many colleges use virtual proctoring software in an effort to reduce cheating on tests that students take virtually at home. But the software relies on facial recognition and assumptions about the proper testing environment. YR Media breaks down the flaws and even provides a simulation so that you can see what it’s like.
-
Based on data from autonomous sensors floating in the oceans, researchers are able to model the flows and characteristics of ocean currents in more detail than ever before. For The New York Times, Henry Fountain and Jeremy White show how the shifts have unwelled centuries-old water deep in the ocean, which releases carbon into the air.
The scrollytelling format of this piece works well to show sensor estimates over time. You get a sense of the currents without needing to see animated lines.
-
Members Only
When you have many categories, use ridgelines to create an extremely compact visualization where you can easily identify major patterns and outliers. They are especially useful to display surges in mostly flat data series.
-
The COVID-19 Online Visualization Collection is a project to catalog Covid-related graphics across countries, sources, and styles. They call it COVIC for short, which seems like a stretch for an acronym and a confusing way to introduce a project to people. But, it does categorize over 10,000 figures, which could be useful as a reference and historical context.
-
NASA is launching the James Webb Space Telescope on December 22, 2021 with an objective to collect data on light from 13.8 billion light-years away.
Using 3-D models from NASA, Rahul Mukherjee and Lorena Iñiguez Elebee for The Los Angeles Times show how the $10 billion telescope works and how NASA plans to launch the telescope into orbit a million miles from Earth. Nothing to it.
-
Zach Levitt and Bonnie Berkowitz for The Washington Post mapped and animated the natural and weather disasters from 2021. Differing from the 2019 version by Tim Meko, they framed it by month, which let them start with floods in January, through the storms in March, April, and May, to fires in July, up to the tornadoes in December.
It was a rough year for many, only compounded by that virus.
-
Steven L. Franconeri, Lace M. Padilla, Priti Shah, Jeffrey M. Zacks, and Jessica Hullman published in Psychological Science in the Public Interest an expansive review of what researchers know so far about how visualization works:
Effectively designed data visualizations allow viewers to use their powerful visual systems to understand patterns in data across science, education, health, and public policy. But ineffectively designed visualizations can cause confusion, misunderstanding, or even distrust—especially among viewers with low graphical literacy. We review research-backed guidelines for creating effective and intuitive visualizations oriented toward communicating data to students, coworkers, and the general public. We describe how the visual system can quickly extract broad statistics from a display, whereas poorly designed displays can lead to misperceptions and illusions. Extracting global statistics is fast, but comparing between subsets of values is slow. Effective graphics avoid taxing working memory, guide attention, and respect familiar conventions. Data visualizations can play a critical role in teaching and communication, provided that designers tailor those visualizations to their audience.
The paper is free to access.
I’m bookmarking this for later. It’s going to take a while to digest.