Easy text classification

Posted to Statistics  |  Tags: ,  |  Nathan Yau

Text can be a great source of data, but it can be a challenge to glean information from an analysis standpoint. etcML can help you with that. Browse Twitter trends, classify your own text with existing machine learning classifiers, or upload your own training data.

But most importantly, you can use etcML to learn interesting new things about whatever text data you’re already working with in your job or research. Say you’re a social scientist with written and multiple-choice survey responses — you can quickly see how well participants’ written text allows you to guess their multiple-choice response. Or say you’re a literary scholar who wants to know what distinguishes an author’s early and late periods — you can train a classifier and visualize the most predictive words for each category.

Saved for later.


Most popular porn searches, by state

We’ve seen that we can learn from what people search for, through the eyes of Google suggestions: state stereotypes, national …

Reviving the Statistical Atlas of the United States with New Data

Due to budget cuts, there is no plan for an updated atlas. So I recreated the original 1870 Atlas using today’s publicly available data.

Where Bars Outnumber Grocery Stores

A closer look at the age old question of where there are more bars than grocery stores, and vice versa.

Watching the growth of Walmart – now with 100% more Sam’s Club

The ever so popular Walmart growth map gets an update, and yes, it still looks like a wildfire. Sam’s Club follows soon after, although not nearly as vigorously.