Extract data from PDF files and export to CSV

Posted to Software  |  Tags: ,  |  Nathan Yau

Tabula, available for Windows and Mac, lets you extract data from PDF files, and it just got an update. The user interface got an overhaul and it’s now easier to grab data from multiple pages. I wrote about Tabula last year, but orgs continue to publish data in PDF files, and sometimes PDF is just all there is. So this is definitely a good thing.

Keep it in your toolbox.

Favorites

Jobs Charted by State and Salary

Jobs and pay can vary a lot depending on where you live, based on 2013 data from the Bureau of Labor Statistics. Here’s an interactive to look.

How to Spot Visualization Lies

Many charts don’t tell the truth. This is a simple guide to spotting them.

Think Like a Statistician – Without the Math

I call myself a statistician, because, well, I’m a statistics graduate student. However, the most important things I’ve learned are less formal, but have proven extremely useful when working/playing with data.

Watching the growth of Walmart – now with 100% more Sam’s Club

The ever so popular Walmart growth map gets an update, and yes, it still looks like a wildfire. Sam’s Club follows soon after, although not nearly as vigorously.