Oftentimes, I find myself wondering what is the fastest and most efficient way to process a dataset. If it takes too long for me to think of an answer, then usually it’s better to just manually do it. Put on the headphones and just start punching in values into a spreadsheet. It almost always takes less time than I thought it would.
But of course, there are many tools to clean up your data, and they can be helpful with the right dataset and situation. I tend to stick to a small handful. Here’s what works for me.
To access this issue of The Process, you must be a member. (If you are already a member, log in here.)
The Process is a weekly newsletter where I evaluate how visualization tools, rules, and guidelines work in practice. I publish every Thursday. Get it in your inbox or access it via the site.
You also gain unlimited access to hundreds of hours worth of step-by-step visualization courses and tutorials, which will help you make sense of data for insight and presentation. Source code and datasets are included so that you can more easily apply what you learn in your own work.
FlowingData is a never-ending source of information, inspiration and instruction in data, visualization and design and Nathan Yau’s approach to exploring these subjects is both technical and familiar, somewhat similar to a discussion of a shared obsession over beers with a friend.
The tutorials are very helpful to move from "Oooo, cool!" to how to actually DO the cool.
…the inspiration I needed to start learning to code with a view to building more sophisticated and engaging data visualisations.