How You Will Die
I returned to the Underlying Cause of Death database maintained by the Centers for Disease Control and Prevention. It provides data for the number of people who died in the United States between 1999 and 2014. The records are based on death certificates, which require an entry for a single cause of death.
The CDC classifies the causes into 113 subcategories, which fit under the umbrella of 20 categories of disease and external causes. More specifically, the CDC uses the International Statistical Classification of Diseases and Related Health Problems (ICD), which is published by the World Health Organization.
The simulation below covers the main categories, or chapters, as they’re referred to by the ICD.
Enter your sex, race, and age. Each dot represents one of your simulated lives, and as each year passes, more of your simulated selves pass away. Color corresponds to cause of death, and the bars on the right keep track of the cumulative percentages. By the end, you’re left with the chances that you will die of each cause.
Try shifting age down to zero and watch the rate of change. One thing that you should notice is that once you get past year one, it’s low likelihood that you die in the next few decades. It’s not until later years when the dots start to change color much quicker. You couldn’t see this in the normalized chart.
The main point, which is what you’d expect, is that mortality rate is much lower in the earlier years of life than in the older years. But, if you do die at a younger age, it’s much more likely due to something external rather than a disease.
You can also look at it the other way. Shift age to the older years, and let the simulations run. You’re much more likely to die of a disease rather than something external. Shift past 80 years, and it’s over 40 percent chance the cause will be circulatory, regardless of demographic group.
This surprised me, because it seems like cancer would be the leading cause just going off general news. This is certainly true up to a certain age, but get past that and your heart can only keep going for so long.
Want to know how other people die? Here’s causes of death all at once.
- The CDC database only provides population estimates for 84 years and younger. For 85-plus, you can only get number of deaths by cause, which means I could not calculate a crude mortality rate for these years. Instead, I used probability of death from the Social Security Administration for the later years. Not ideal, but since we’re focused on cause of death rather than timing, it was good enough for me.
- Along the same lines, the CDC mortality numbers stop at 100 years old or greater.
- The ICD chapters encapsulate many causes of death. It’s worth looking at the database if you’re interested in a specific cause.
- I did analysis and data preparation in R and I made the interactive with d3.js.
Learn to Visualize Data See All →
How to Make a Multi-line Step Chart in R
For the times your data represents immediate changes in value.
How to Map and Use GeoTIFF Files in R
It’s like working with a bunch of tiny dots, and oh look, all of sudden patterns emerge.
How to Make Ternary Plots in R, with ggplot2
When you want to compare between three parts of your data, ternary plots might be a good option. Here is how to make them.