Goodbye MapReduce

Cascading is a feature rich API for defining and executing complex and fault tolerant data processing workflows on a Hadoop cluster.

The processing API lets the developer quickly assemble complex distributed processes without having to "think" in MapReduce. And to efficiently schedule them based on their dependencies and other available meta-data.

Read more about Cascadings features.

Also see our About page. For a simple primer on using Cascading, please read our Gentle Introduction. And for those already familiar with Hadoop and MapReduce, read our detailed Overview on how Cascading works.

With our Groovy extension, Cascading.groovy, it is even simpler for both advanced and casual Hadoop users to create complex processing workflows. Read more on our Groovy Scripting page.

posted @ 2008-10-27 23:26  octoberfirst  阅读(250)  评论(0编辑  收藏  举报