Easy text classification

Posted to Statistics  |  Tags: ,  |  Nathan Yau

Text can be a great source of data, but it can be a challenge to glean information from an analysis standpoint. etcML can help you with that. Browse Twitter trends, classify your own text with existing machine learning classifiers, or upload your own training data.

But most importantly, you can use etcML to learn interesting new things about whatever text data you’re already working with in your job or research. Say you’re a social scientist with written and multiple-choice survey responses — you can quickly see how well participants’ written text allows you to guess their multiple-choice response. Or say you’re a literary scholar who wants to know what distinguishes an author’s early and late periods — you can train a classifier and visualize the most predictive words for each category.

Saved for later.

Favorites

A Day in the Life of Americans

I wanted to see how daily patterns emerge at the individual level and how a person’s entire day plays out. So I simulated 1,000 of them.

Most popular porn searches, by state

We’ve seen that we can learn from what people search for, through the eyes of Google suggestions: state stereotypes, national …

The Best Data Visualization Projects of 2014

It’s always tough to pick my favorite visualization projects. Nevertheless, I gave it a go.

Think Like a Statistician – Without the Math

I call myself a statistician, because, well, I’m a statistics graduate student. However, the most important things I’ve learned are less formal, but have proven extremely useful when working/playing with data.