Easy text classification

Posted to Statistics  |  Tags: ,  |  Nathan Yau

Text can be a great source of data, but it can be a challenge to glean information from an analysis standpoint. etcML can help you with that. Browse Twitter trends, classify your own text with existing machine learning classifiers, or upload your own training data.

But most importantly, you can use etcML to learn interesting new things about whatever text data you’re already working with in your job or research. Say you’re a social scientist with written and multiple-choice survey responses — you can quickly see how well participants’ written text allows you to guess their multiple-choice response. Or say you’re a literary scholar who wants to know what distinguishes an author’s early and late periods — you can train a classifier and visualize the most predictive words for each category.

Saved for later.


Pizza Place Geography

Most of the major pizza chains are within a 5-mile radius of where I live, so I have my pick, …

Most popular porn searches, by state

We’ve seen that we can learn from what people search for, through the eyes of Google suggestions: state stereotypes, national …

Reviving the Statistical Atlas of the United States with New Data

Due to budget cuts, there is no plan for an updated atlas. So I recreated the original 1870 Atlas using today’s publicly available data.

10 Best Data Visualization Projects of 2015

These are my picks for the best of 2015. As usual, they could easily appear in a different order on a different day, and there are projects not on the list that were also excellent.