10 Largest Data Breaches Since 2000 – Millions Affected

In light of the MySpace photo breach (due to their negligence) a couple of months ago, I got to wondering about other recent data breaches. It turns out Attrition.org keeps a Data Loss Archive and Database that contains known data breaches since 2000. Records include date, number affected, groups involved, summaries, and links to reported stories and updates. It’s surprisingly detailed and even better, it’s all available for download.

The above graphic shows the 10 largest data breaches which affected millions. I thought the 800,000 records thieved from UCLA a couple of years ago (that my information was unfortunately a part of) was a lot. That’s nothing compared to these.

Notice the higher frequency as we get closer to the present?

[Thanks Ryan | Welcome, Boing Boing readers]


Become a member. Support an independent site. Make great charts.

See What You Get

Learn to Visualize Data See All →

How to Visualize Ranges of Data in R

When you want to focus on the magnitude of differences between low and high values, use visual cues that highlight distance.

How to Visualize and Compare Distributions in R

Single data points from a large dataset can make it more relatable, but those individual numbers don’t mean much without something to compare to. That’s where distributions come in.

Voronoi Diagram and Delaunay Triangulation in R

The deldir package by Rolf Turner makes the calculations and plotting straightforward, with a few lines of code.

How to Make an Animated Donut Chart in R

There are “better” ways to show proportions over time, but sometimes you just want an animated donut.


Peak Non-Creepy Dating Pool

Based on the “half-your-age-plus-seven” rule, the range of people you can date expands with age. Combine that with population counts and demographics, and you can find when your non-creepy dating pool peaks.

Think Like a Statistician – Without the Math

I call myself a statistician, because, well, I’m a statistics graduate student. However, the most important things I’ve learned are less formal, but have proven extremely useful when working/playing with data.

Years You Have Left to Live, Probably

The individual data points of life are much less predictable than the average. Here’s a simulation that shows you how much time is left on the clock.

Reviving the Statistical Atlas of the United States with New Data

Due to budget cuts, there is no plan for an updated atlas. So I recreated the original 1870 Atlas using today’s publicly available data.