-
Most television shows don’t get past the first season, but there are some that manage to stick around. These are the 175 longest running shows on IMDb that have ratings.
-
Members Only
-
Yuhao Du, Jessica Nordell, and Kenneth Joseph used simulations to study the effects of small gender biases at entry level up to executive level. It doesn’t take much to skew the distribution. For NYT Opinion, Yaryna Serkez shows the simulation in action with moving bubbles and stacked area charts for each work level.
The simulation imagines a company where female performance is undervalued by 3 percent. Each dot represents an employee, and they either move up with promotions or stay still. The distribution of men and women start even but end very uneven.
-
[arve url=”https://www.youtube.com/watch?v=2p3NIR8LYoo” loop=”no” muted=”no” /]
Vox shows how the 3-point line is “breaking” the game.
The basic math says a 3-point shot is more efficient for scoring points than a 2-point shot if the team can make a high enough percentage of attempts. It’s why the mid-range shot has fallen out of favor.
But it’s more an evolution than a breaking. Defense adapts, and then offense adjusts to that, etc. Stephen Curry making double-digit threes is still way more exciting than Curry not making threes.
-
When plotting Russian election results, a structured grid patterns appear. From The Economist:
When Dmitry Kobak and Sergey Shpilkin, two researchers, analysed the results, they found that an unusually high number of turnout and vote-share results were multiples of five (eg, 50%, 55%, 60%), a tell-tale sign of manipulation. According to Messrs Kobak and Shpilkin, there were at least 1,310 polling stations (out of 96,325) with results that were suspiciously tidy, with rounder numbers than you would expect to see by chance.
I’m not familiar with Russian elections, but this seems like lazy cheating. Are they just making up numbers by hand or what?
Check out the full results of Kobak and Shpilkin’s analysis in Python notebook form.
-
Nightingale is a publication from Data Visualization Society that offers more depth for many topics in the field of visualization. They’re working on a print magazine of the same name.
Subscribed.
-
Tim Harford warns against bad data in science:
Some frauds seem comical. In the 1970s, a researcher named William Summerlin claimed to have found a way to prevent skin grafts from being rejected by the recipient. He demonstrated his results by showing a white mouse with a dark patch of fur, apparently a graft from a black mouse. It transpired that the dark patch had been coloured with a felt-tip pen. Yet academic fraud is no joke.
-
We often hear about increased CO2 in the context of global warming. Hayley Warren and Akshat Rathi for Bloomberg show why we should talk more about methane:
In the fight against global warming, methane has flown under the radar for too long. But there’s increasing recognition that tackling the invisible, odorless gas is one of the easiest, cheapest, and most feasible ways to make a real difference in slowing climate change. It’s the rare climate problem with a fix that can be felt by those alive right now, not their great-grandchildren.
-
Spike maps use the height of spikes to encode data geographically. The format provides a similar effect to frequency trails where the layering looks 3-D-ish, except spikes are typically centered on an area instead of running parallel.
Anyways, like most visualization methods with a name, there is an R package for spike maps by Timothée Giraud.
If D3.js is your jam, there’s also a solution for that. You can also take the no-code route with Datawrapper.
-
Members Only
-
The Washington Post, in an effort that I’m sure took more energy and time than it looks, compared U.S. abortion laws against those in other countries:
In the last three decades, countries around the world have made it easier to legally get an abortion. In some parts of the United States, however, it’s gotten harder. Just this month, Mexico’s supreme court ruled to decriminalize the practice. Argentina’s Senate legalized abortion in December. Meanwhile, New Zealand, Thailand and Ireland have all taken steps to ease abortion restrictions in recent years.
Hm.
-
Zan Armstrong, Ian Johnson, and Mike Freeman for Observable wrote a guide on analyzing time series data. Using an energy dataset, they show how asking different questions can lead to different findings and visualizations:
These are stories about analyzing data that changes over time. While most of us don’t dig into data about energy day-to-day, we hope the feel of this data and these questions will be familiar to anyone who regularly faces questions like “what changed?”, “what happened?”, “was that normal?”, “what is typical?”, and “did things go as expected?” We hope that this will spark an idea about how to look at your own data in a new way.
I will never tire of the multiple-views-from-the-same-dataset teaching device.
-
Inspired by a graphic on Reddit, Jim Vallandingham expanded the format for all the shows. Search for a show and get a heatmap for average ratings by season and episode. See how your favorite show went into the dumpster at the end or withstood the test of time. Nice.
The data comes from IMDb Datasets, which seems like a fun time series dataset to poke at.
-
Cloudflare describes how things looked from their point of view the day that Facebook, along with its other properties, went down. From the Border Gateway Protocol, which defines routing information:
A BGP UPDATE message informs a router of any changes you’ve made to a prefix advertisement or entirely withdraws the prefix. We can clearly see this in the number of updates we received from Facebook when checking our time-series BGP database. Normally this chart is fairly quiet: Facebook doesn’t make a lot of changes to its network minute to minute.
But at around 15:40 UTC we saw a peak of routing changes from Facebook. That’s when the trouble began.
-
Aaron Steckelberg and Tik Root for The Washington Post provide a visual guide on how to protect your home from wildfire. It starts with an ember floating carefree in the air, and then the tour highlights actions you can take.
Can’t wait for the guides on how to protect your home from flooding and/or how to dress for extreme temperatures.
-
Companies collect and aggregate location data from millions of people’s phones. Then that data gets sold in a multibillion-dollar market. Jon Keegan and Alfred Ng for The Markup report on who’s doing the collecting and where your data goes:
Once a person’s location data has been collected from an app and it has entered the location data marketplace, it can be sold over and over again, from the data providers to an aggregator that resells data from multiple sources. It could end up in the hands of a “location intelligence” firm that uses the raw data to analyze foot traffic for retail shopping areas and the demographics associated with its visitors. Or with a hedge fund that wants insights on how many people are going to a certain store.
“There are the data aggregators that collect the data from multiple applications and sell in bulk. And then there are analytics companies which buy data either from aggregators or from applications and perform the analytics,” said Tsiounis of Advan Research. “And everybody sells to everybody else.”
It just makes you feel all warm and fuzzy inside.
-
Members Only
-
Whether it’s because of experience, physical ability, or education level, some jobs tend towards a certain age of worker more than others.
-
UnstableGround is a project from the Woodwell Climate Research Center that focuses on climate change in the Arctic:
Climate change is transforming the Arctic, impacting people and ecosystems across this vast region. But because our climate system is connected globally, what happens in the Arctic doesn’t stay in the Arctic.
Discover how Arctic landscapes are changing and learn about the consequences for communities across the globe.
A stories section uses maps, charts, and photographs to communicate the ongoing research.
-
Before you get into analysis and visualizing data with R, you need to know the basics. Steve Haroz wrote a guide on getting started:
This book is a short introduction to the R language. It covers the basics of R that are not covered by analysis and visualization guides like R for Data Science. Consider it a quick way to get up to speed on R before diving into the analysis and visualization aspects.
This example-focused guide assumes you are familiar with programming concepts but want to learn the R language. It offers more examples than an “R cheat sheet” without the verbosity of a language spec or an introduction to programming.