Easy text classification

Posted to Statistics  |  Tags: ,  |  Nathan Yau

Text can be a great source of data, but it can be a challenge to glean information from an analysis standpoint. etcML can help you with that. Browse Twitter trends, classify your own text with existing machine learning classifiers, or upload your own training data.

But most importantly, you can use etcML to learn interesting new things about whatever text data you’re already working with in your job or research. Say you’re a social scientist with written and multiple-choice survey responses — you can quickly see how well participants’ written text allows you to guess their multiple-choice response. Or say you’re a literary scholar who wants to know what distinguishes an author’s early and late periods — you can train a classifier and visualize the most predictive words for each category.

Saved for later.

Favorites

Top Brewery Road Trip, Routed Algorithmically

There are a lot of great craft breweries in the United States, but there is only so much time. This is the computed best way to get to the top rated breweries and how to maximize the beer tasting experience. Every journey begins with a single sip.

Real Chart Rules to Follow

There are rules—usually for specific chart types meant to be read in a specific way—that you shouldn’t break. When they are, everyone loses. This is that small handful.

Reviving the Statistical Atlas of the United States with New Data

Due to budget cuts, there is no plan for an updated atlas. So I recreated the original 1870 Atlas using today’s publicly available data.

A Day in the Life of Americans

I wanted to see how daily patterns emerge at the individual level and how a person’s entire day plays out. So I simulated 1,000 of them.