FlowingData

Members Only

Tutorials / R

How to Make a Heatmap with Irregular Bins in R

There are existing functions and packages to make heatmaps in R, but when the data is irregular, it’s worth going custom.

Read More
Real space location of Star Trek and other science fiction

October 11, 2023

Topic
Maps / fiction, Overview Effect, space, Star Trek

Star Trek and related might be fictional, but they usually reference real stars, planets, and galaxies. Overview Effect mapped, charted, and spreadsheeted the possible real-life locations in space of the fictional places.
Nobel Prize for research in global labor markets, using historical data

October 10, 2023

Topic
Statistics / Claudia Goldin, economics, gender, Nobel Prize, work

Claudia Goldin, an economist at Harvard, has won the Nobel Prize in Economics. A big part of her studies are rooted in the collection and analysis of centuries-old data:

Women are vastly underrepresented in the global labour market and, when they work, they earn less than men. Claudia Goldin has trawled the archives and collected over 200 years of data from the US, allowing her to demonstrate how and why gender differences in earnings and employment rates have changed over time.

Goldin showed that female participation in the labour market did not have an upward trend over this entire period, but instead forms a U-shaped curve. The participation of married women decreased with the transition from an agrarian to an industrial society in the early nineteenth century, but then started to increase with the growth of the service sector in the early twentieth century. Goldin explained this pattern as the result of structural change and evolving social norms regarding women’s responsibilities for home and family.

Amazing.

The illustrations by Johan Jarnestad that accompany the announcement are also really useful.
Mapping Israel and Gaza attacks

October 9, 2023

Topic
Maps / Gaza, missile, New York Times

The New York Times is mapping rocket strikes, attacks, and ongoing conflict in the region.
News organizations blocking OpenAI

October 9, 2023

Topic
Data Sharing / news, OpenAI, scraping

Ben Welsh has a running list of the news organizations blocking OpenAI crawlers:

In total, 532 of 1,147 news publishers surveyed by the homepages.news archive have instructed OpenAI, Google AI or the non-profit Common Crawl to stop scanning their sites, which amounts to 46.4% of the sample.

The three organizations systematically crawl web sites to gather the information that fuels generative chatbots like OpenAI’s ChatGPT and Google’s Bard. Publishers can request that their content be excluded by opting out via the robots.txt convention.

On the web, it used to be that you would write or make something and there would be a link to the thing. Other websites could link to the thing, and people would go to the place with the thing. With this recent AI wave, a lot of the thing ends up elsewhere and no one sees the original place.

Fun times ahead.
Following the path of the 2024 solar eclipse

October 6, 2023

Topic
Maps / Andy Woodruff, eclipse

There is going to be a solar eclipse on April 8, 2024. The moon will only partially block out the sun in most areas (if at all), but for a select few in the right path, it’ll go all dark for a few minutes. Andy Woodruff mapped the path of full eclipse-ness.
Members Only

Anti-Aggregate

October 5, 2023

Topic
The Process / disaggregation

Look at individual data points separately and get more out of the small pictures.
Tree rings to compare life expectancy in your state

October 5, 2023

Topic
Infographics / mortality, Washington Post

The Washington Post goes with a tree ring metaphor to compare life expectancy in your state. Enter your sex, age, and state. The inner white circles represent how old you are, the middle yellow circles represent how many expected years you have left, and the outer red circles represent the expected years of those with the same age and sex but in Japan.

The rings are a lead-in graphic to more statistical charts. I kind of wish they went all in with the rings, but that’d probably be limiting in the points they could get across.
Data Underload / scale, sports

Scale of Sports Fields and Courts

Here are the major sports drawn at the same scale to help appreciate the differences between field and court sizes.

Read More
Billion dollar natural disasters

October 3, 2023

Topic
Statistical Visualization / Bloomberg, budget, disaster, government

For Bloomberg, Rachael Dottle and Leslie Kaufman go with the combo stacked area chart and stacked bar chart on the top and bottom to show increased cost of billion dollar disasters and the counts over time. It looks like the sporadic tropical cyclones are causing the most damage.
Visual explanation of menstrual cycle length and variability

October 2, 2023

Topic
Infographics / menstrual cycle

For the Apple Women’s Health Study, which uses cycle tracking data from iPhones and Apple Watches, the Harvard T.H. Chan School of Public Health provides a visual explanation of how menstrual cycles vary. With some tweaks, it could also stand in as an explainer for distributions and averages.
Data Underload / mortality

How Cause of Death Shifted with Age and Time in America

As we get older, our life expectancy declines. But when and how quickly the decline happens and how it happens has changed over the years.

Read More
Members Only

Visualization Tools and Learning Resources, September 2023 Roundup

September 28, 2023

Topic
The Process / roundup

Every month I collect tools and resources to help you make better charts. Here’s the good stuff for September.
Search the text in historical maps

September 28, 2023

Topic
Maps / David Rumsey, search, text

The David Rumsey Map Collection has been home to tens of thousands of historical maps, and now you can search the collection by the text in the maps instead of just through metadata:

About 57,000 of the georeferenced maps from the collection have been processed with a machine learning tool called mapKurator to collect all the text on maps, i.e. every piece of text (printed or handwritten) that appears on a map, including place names, but also information like a map’s title or its scale, or the names of the people involved in producing the map. Through this process of automatic annotation, this approach turns the text on maps into structured data (text on maps data): the enormous amount of data that has been collected from these maps is now searchable.
Loneliness, life satisfaction, and time

September 27, 2023

Topic
Infographics / alone, Alvin Chang, The Pudding, time use, well-being

For The Pudding, Alvin Chang examines loneliness through the lens of individual responses from the American Time Use Survey:

In this story, we’ll go through 24 hours of a typical weekend day in 2021. We know what people did – and who they did it with – because, since 2003, the American Time Use Survey has asked people to track how they use their time.

By the end of the day, we’ll learn that Martin’s isolation isn’t unique. In fact, loneliness has become a far more common experience in the last few decades – and it was supercharged by the pandemic.

The heart of the piece is in the anti-aggregate view of individuals through 24 hours. See each person’s schedule, who they spend time with, and how that changes through the day. Sorting draws patterns. The scrolly clock on the right ticks. And it works on mobile. Chang makes the data immediately relatable.

See it.
Two maps with the same scale

September 26, 2023

Topic
Maps / Josh Horowitz, scale

When you compare two areas on a single map, it can be a challenge to compare the actual size of them because of the trade-offs with projecting a three-dimensional space onto a two-dimensional space. Josh Horowitz made a thing that automatically rescales side-by-side maps as you pan and zoom, so that you get a more accurate comparison.
Evolution of beer

September 25, 2023

Topic
Infographics / beer, Washington Post

Beer dates back thousands of centuries, but it was not the beer we know today. It might have been more… chewy? More like gruel? Sounds amazing. With a fun illustrated piece, The Washington Post describes the evolution of beer, from the chunky fermented grain stuff to the clear carbonated beverage in cans.
Twitter slows competitor links

September 22, 2023

Topic
Infographics / slow, The Markup, Twitter

When you click a link on Twitter, you go through a Twitter shortlink first and then to the place you want to go. When you click on a link that points to one of Twitter’s competitors, by complete coincidence I am sure, there’s a delay. For The Markup, Jon Keegan, Dan Phiffer and Joel Eastwood ran the tests. You can also try it with your own URLs.

I’m into the animated opening graphic.
Members Only

Calming Data

September 21, 2023

Topic
The Process / calm, feeling, insight

Chart-making provides a perspective that’s beyond my own, which can be calming.
Crows might understand probabilities

September 21, 2023

Topic
Statistics / Ars Technica, birds, probability

Researchers at the University of Tübingen are studying crows’ abilities to understand statistical inference. For Ars Technica, Kenna Hughes-Castleberry reports:

To do this, Johnston and her team began by training two crows to peck at various images on touchscreens to earn food treats. From this simple routine of peck-then-treat, the researchers significantly raised the stakes. “We introduce the concept of probabilities, such as that not every peck to an image will result in a reward,” Johnston elaborated. “This is where the crows learn the unique pairings between the image on the screen and the likelihood of obtaining a reward.” The crows quickly learned to associate each of the images with a different reward probability.

In the experiment, the two crows had to choose between two of these images, each corresponding to a different reward probability. “Crows were tasked with learning rather abstract quantities (i.e., not whole numbers), associating them with abstract symbols, and then applying that combination of information in a reward maximizing way,” Johnston said. Over 10 days of training and 5,000 trials, the researchers found that the two crows continued to pick the higher probability of reward, showing their ability to use statistical inference.