Loans from Freddie and Fannie that defaulted after the bubble

Posted to Statistics  |  Tags: , ,  |  Nathan Yau

Under the directive of the Federal Housing Finance Agency, started to release detailed loan-level data in 2013. Todd W. Schneider looked at the data recently, evaluating default rates — the proportion of loans that fell into deliquency — with a bit of geography.

California, Nevada, Arizona, and Florida were in particularly bad shape during the 2005 through 2007 bubble. Some counties had more than 40 percent of loans default. I don’t know much about loans, but that seems high. And there was plenty of contrast as you evaluate nearby areas.

It’s less than 100 miles from San Francisco to Modesto and Stockton, and only 35 miles from Anaheim to Riverside, yet we see such dramatically different default rates between the inland regions and their relatively more affluent coastal counterparts.

Aside from the analysis though, maybe the most interesting bit is Schneider’s previous experience as a mortgage analyst and the contrast of analysis a few years ago to now.

Between licensing data and paying for expensive computers to analyze it, you could have easily incurred costs north of a million dollars per year. Today, in addition to Fannie and Freddie making their data freely available, we’re in the midst of what I might call the “medium data” revolution: personal computers are so powerful that my MacBook Air is capable of analyzing the entire 215 GB of data, representing some 38 million loans, 1.6 billion observations, and over $7.1 trillion of origination volume. Furthermore, I did everything with free, open-source software. I chose PostgreSQL and R, but there are plenty of other free options you could choose for storage and analysis.

You can check out the code on GitHub. [Thanks, Todd]

Favorites

The Most Unisex Names in US History

Moving on from the most trendy names in US history, let’s look at the most unisex ones. Some names have …

Top Brewery Road Trip, Routed Algorithmically

There are a lot of great craft breweries in the United States, but there is only so much time. This is the computed best way to get to the top rated breweries and how to maximize the beer tasting experience. Every journey begins with a single sip.

Unemployment in America, Mapped Over Time

Watch the regional changes across the country from 1990 to 2016.

A Day in the Life of Americans

I wanted to see how daily patterns emerge at the individual level and how a person’s entire day plays out. So I simulated 1,000 of them.