The Natural Language Toolkit is a Python library that is commonly used to extract data from text. There’s a free, online-accessible book to learn how to use it.
Software
Programs and online applications to help you make use of data.
-
Link
NLTK Book
-
Link
Visual Sedimentation
By Samuel Huron and Romain Vuillemot, Visual Sedimentation is “a JavaScript library for visualizing streaming data, inspired by the process of physical sedimentation.” Be sure to check out the examples.
-
Learn about politics in your state with Open States
It’s not especially straightforward to know or find out what’s going on with…
-
Link
Advanced R development book
Hadley Wickham has been working on a book that covers advanced programming in R, namely programming concepts and workflow and package development. It goes to print later this year, but the contents are freely available for consumption now.
-
Link
Geocoding in Google Spreadsheets →
Nifty trick that uses Mapquest API as source
-
Link
iWantHue →
Automatically make a color palette based on color space
-
Link
R Google Analytics →
Analyze web traffic [via]
-
Link
Arc Diagrams in R →
Using this Protovis example with Les Misérables
-
Link
svg.js →
A lightweight library for manipulating and animating SVG
-
Link
Shiny Server →
Run R applications online
-
Link
Ayasdi →
A tool that advertises “automatic insights” from complex data, looks like mainly with clustering and network graphs
-
Link
slitscanner.js →
Make a sound sculpture from any YouTube video
-
Link
D3 3.0 →
Built-in geographic projections, better transitions, and more extensive asynchronous requests
-
Sitegeist: A mobile app that tells you about your data surroundings
From businesses to demographics, there’s data for just about anywhere you are. Sitegeist,…
-
Link
Hexagonal binning in D3 →
Useful with dense point clouds; also this method using size instead of color
-
Shiny allows web applications with R
RStudio, the folks behind the IDE for R released last year, continues to…
-
xkcd-style charts in R, JavaScript, and Python
The ports and packages to make your charts look like they came from…
-
Torque for mapping temporal data
Mapping data over time can be challenging, especially when you have a lot…
-
Easy and customizable maps with TileMill →
I’m late to this party. TileMill, by mapping platform MapBox, is open source…
-
Analyze your Facebook profile with Wolfram|Alpha
Feeding off the momentum from Stephen Wolfram’s personal analytics earlier this year, Wolfram|Alpha…