• Many datasets were analyzed and many charts were made this year. If I liked a project, it was on FlowingData. But only a handful can be the best. These are my favorite data visualization projects from 2024.

  • Fabien Wagner works with CTrees in an effort to map tree canopy height in the Amazon rainforest. In the process, he might have found the largest tree in the Amazon:

    I am currently working on a model to map the tree canopy height of the entire Amazon forest using Planet NICFI images. The model can detect canopy height up to 45-50 meters. By inspecting the results, we can discover the tallest and largest trees in the entire Amazon.

    For this tree, specifically: I had previously searched the data for individual trees with a crown diameter of 50-55 meters, which to my knowledge, is the largest possible crown size observed in the Amazon. After finding a few trees in this range, I realized that, statistically, an even larger tree could exist. That’s when I expanded my search, bringing me to this exceptional specimen.

    Pretty big.

  • Because of a warming planet with more wildfires and hurricanes, it’s growing more common for insurance companies to stop insuring people with existing policies. For the New York Times, Christopher Flavelle, with analysis and graphics by Mira Rojanasakul, describes the coverage shift.

    The consequences could be profound. Without insurance, you can’t get a mortgage; without a mortgage, most Americans can’t buy a home. Communities that are deemed too dangerous to insure face the risk of falling property values, which means less tax revenue for schools, police and other basic services. As insurers pull back, they can destabilize the communities left behind, making their decisions a predictor of the disruption to come.

    Oh good.

  • For the New York Times, Marco Hernandez visualized all the tornadoes in 2024, which amounted to more than the usual.

    Not only were there more tornadoes reported, but 2024 is also on track to be one of the costliest years ever in terms of damage caused by severe storms, according to the National Center for Environmental Information. Severe weather and four tornado outbreaks from April to May in the central and southern United States alone cost $14 billion.

    Oh good.

  • By just about every metric, flights are slower now than they were 30 years ago. However, the percentage of flights that arrive early is higher, based on an analysis by Ben Blatt for NYT’s Upshot.

    Why do today’s flights arrive early more often, even though they’re slower? Airlines have extended their scheduled flight durations even more than the flights have lengthened in actual duration. The average scheduled duration from J.F.K. to Los Angeles has increased 23 minutes since 1995, according to an Upshot analysis of Bureau of Transportation Statistics data.

    Publish a metric that involves money, and companies optimize for it somehow. In this case, the airlines lowered expectations to create a buffer.

  • Christmas is coming. For some, that means snow on the ground, but as areas get less snow than in previous years, that means more regular Christmases than White Christmases in the United States. For Bloomberg, Denise Lu mapped the last time cities had at least one inch of snow fall or accumulation.

    Compared to Christmases since 1970, last year’s holiday had the lowest average snow on the ground across the continental US. The chance for a white Christmas this year may be higher than last year, according to AccuWeather, but many areas will likely still not see any snow.

    Bonus points for the shoveler with a Santa hat that doubles as a progress bar:

    This is in contrast to the Washington Post piece from last week. WaPo used similar data from NOAA but highlighted cumulative snowfall instead of the lack of it.

  • Jordan Cunliffe uses embroidery to explore and visualize data. This year, headed towards Christmas, he’s been posting one stitch per day for a Data Advent Calendar.

  • This is a fun one by Dylan Moriarty for the Washington Post. Punch in your city to see the percentage of Christmas days (or Hanukkah, New Year’s Eve, or Winter Solstice) since 1940 when it snowed.

    It’s nice and corny like it should be, with holiday illustrations and snow falling on the screen. There’s even a data postcard that you can download and share with all your friends and family, because they will most definitely be interested.

    Maybe it’s just the movies telling me so, but there’s something special about snow falling, holiday lights, Christmas music playing in the background, and a warm drink in hand.

  • Researchers at Sony Computer Science Laboratories in Rome estimated the time it takes to walk to places for amenities such as education, healthcare, and restaurants:

    Proximity-based cities have attracted much attention in recent years. The ‘15-minute city’, in particular, heralded a new vision for cities where essential services must be easily accessible. Despite its undoubted merit in stimulating discussion on new organization of cities, the 15-minute city cannot be applicable everywhere, and its very definition raises a few concerns. Here we tackle the feasibility and practicability of the 15-minute city model in many cities worldwide. We provide a worldwide quantification of how close cities are to the ideal of the 15-minute city. To this end, we measure the accessibility times to resources and services, and we reveal strong heterogeneity of accessibility within and across cities, with a pivotal role played by local population densities.

    Check your city on the interactive map. The Washington Post has a version that is easier to use, but it only shows the data for a subset of cities in the United States, whereas the Sony map shows cities around the world with more granular breakdowns.

  • Members Only

    Here’s the good stuff for the month: things to use, things to poke at, and things to learn from.

  • Probability itself is an expression of uncertainty, but the calculations behind the percentages can seem objective. David Spiegelhalter argues that this is impossible:

    My argument is that any practical use of probability involves subjective judgements. This doesn’t mean that I can put any old numbers on my thoughts — I would be proved a poor probability assessor if I claimed with 99.9% certainty that I can fly off my roof, for example. The objective world comes into play when probabilities, and their underlying assumptions, are tested against reality (see ‘How ignorant am I?’); but that doesn’t mean the probabilities themselves are objective.

    Some assumptions that people use to assess probabilities will have stronger justifications than others. If I have examined a coin carefully before it is flipped, and it lands on a hard surface and bounces chaotically, I will feel more justified with my 50–50 judgement than if some shady character pulls out a coin and gives it a few desultory turns. But these same strictures apply anywhere that probabilities are used — including in scientific contexts, in which we might be more naturally convinced of their supposed objectivity.

    Riffing off George E. P. Box’s “All models are wrong, but some are useful”, Spiegelhalter concludes, “In our everyday world, probability probably does not exist — but it is often useful to act as if it does.”

  • Not to dwell on Musk, but I was reminded of this dot plot by the Economist that shows when he posted on Twitter/X and how much. The volume picked up a bit post-acquisition.

  • As you can imagine, Elon Musk, the owner of X, has a prominent influence on the social media platform. For the Washington Post, Jeremy B. Merrill, Trisha Thadani, and Kevin Schaul show the scale through view counts (on the x-axis) and time since posting (on the y-axis).

    I’m not sure how much I trust view count accuracy on X, but you can at least see the relative weights of what the company wants you to see.

    Mostly, I put this here for the flaming data point at the beginning of the piece and to highlight how it reminds me of the interactive by Periscopic a few years back. It starts with a single point for the anecdote and then multiplies for the patterns.

  • Here’s the current landscape of supermarket parent companies and their subsidiaries — national chains, regional, local, co-ops, specialty, ethnic, and discount.

  • There was a brouhaha a couple months ago over research that suggested black plastic spatulas spew poison into our food. The problem is that there was a basic arithmetic mistake that made the results seem a lot worse than they actually were. For National Post, Joseph Brean reports:

    The paper correctly gives the reference dose for BDE-209 as 7,000 nanograms per kilogram of body weight per day, but calculates this into a limit for a 60-kilogram adult of 42,000 nanograms per day. So, as the paper claims, the estimated actual exposure from kitchen utensils of 34,700 nanograms per day is more than 80 per cent of the EPA limit of 42,000.

    That sounds bad. But 60 times 7,000 is not 42,000. It is 420,000. This is what Joe Schwarcz noticed. The estimated exposure is not even a tenth of the reference dose. That does not sound as bad.

    They had a per-unit baseline they could compare against to gauge if their estimates were good, bad, or dangerous. To scale up to typical human body weight, they had to multiply, but they missed a zero somewhere.

    Red flags should’ve flown immediately when they saw 80% of the safe limit. Just a quick double check to see if they made a mistake. The researchers said they made a typo, but I wonder where and when they made that typo. While writing the paper? While calculating? While recording the data?

    The irony is that a bunch of people probably tossed their black kitchenware and will now go buy more plastic after finding out it wasn’t so bad after all. The researchers were trying to communicate the opposite.

  • For Wired, Dhruv Mehrotra and Dell Cameron, in collaboration with Bayerischer Rundfunk and Netzpolitik.org, check out location data for overseas military personnel that is probably too easy to buy. The data from a broker is primarily meant for marketing purposes so that companies can better target ads, but there isn’t exactly a strict vetting process for where the money comes from.

    Sooo, this seems not very good.

  • For Works in Progress, Jared Hutchins explains how milk production increased per cow, a combination of artificial insemination, cryogenic preservation, and more data.

    Note the dual axes in the chart above. The left y-axis shows total dairy cows, and the right shows average yield per cow. The natural next question: Did total milk production also increase with fewer cows? Yes.

    In 1945, there were over 25 million dairy cattle in the United States; by 1980, the number had dropped by more than half to just over ten million. And the dairy industry as a whole became more productive. Between 1940 and 1982, the total supply of milk increased by a third, even as the number of dairy cows in the US halved.

    I’m going to assume that we’d see similar patterns with other livestock and crops. Which has been moneyballed the hardest?

  • Members Only

    Usually we focus on actual data points for insights, but we can also highlight the spaces between for a flipped point of view.

  • For Reuters, Arathy J Aluckal, Jitesh Chowdhury, and Vijdan Mohammad Kawoosa illustrate how the mpox virus spreads, its variants, and who it affects.

    The last chart in the article, shown above, struck me as familiar but took me a second. It’s a pyramid chart that puts women on the left and men on the right. The y-axis represents age groups. You can see the contrast between those infected in the African region (men and women somewhat evenly) and the rest of the world (almost all men). The variant in Africa can spread through close contact.

  • For NBC News, Joe Murphy shows the play rate on Last.fm for Mariah Carey’s “All I Want for Christmas Is You” as a set of calendar heatmaps, since 2006.

    The song was released in October 1994. The data doesn’t go back that far, but I’d wager it played mostly in December, and then plays kept shifting earlier until leaking in to August. I’m looking forward to a couple decades from now when I can remind the young folks that this wasn’t always an Easter song.