Extract data from PDF files and export to CSV

Posted to Software  |  Tags: ,  |  Nathan Yau

Tabula, available for Windows and Mac, lets you extract data from PDF files, and it just got an update. The user interface got an overhaul and it’s now easier to grab data from multiple pages. I wrote about Tabula last year, but orgs continue to publish data in PDF files, and sometimes PDF is just all there is. So this is definitely a good thing.

Keep it in your toolbox.

Favorites

Think Like a Statistician – Without the Math

I call myself a statistician, because, well, I’m a statistics graduate student. However, the most important things I’ve learned are less formal, but have proven extremely useful when working/playing with data.

Graphical perception – learn the fundamentals first

Before you dive into the advanced stuff – like just about everything in your life – you have to learn the fundamentals before you know when you can break the rules.

Reviving the Statistical Atlas of the United States with New Data

Due to budget cuts, there is no plan for an updated atlas. So I recreated the original 1870 Atlas using today’s publicly available data.

Pizza Place Geography

Most of the major pizza chains are within a 5-mile radius of where I live, so I have my pick, …