“…an open source web mining toolkit that runs as a series of Cascading pipes on top of Hadoop”