hello world!!!!!

写下自己的一些心得,写下自己问题的方式,写下程序之路的艰辛,希望能够有朝一日成为大牛。
  博客园  :: 首页  :: 新随笔  :: 联系 :: 订阅 订阅  :: 管理

一些usefull的mapper

Posted on 2011-12-17 01:57  陈力  阅读(148)  评论(0编辑  收藏  举报
The function generates a (possibly empty) list of (K2, V2) pairs for a given (K1, V1)
input pair. The OutputCollector receives the output of the mapping process, and
the Reporter provides the option to record extra information about the mapper as
the task progresses.
Hadoop provides a few useful mapper implementations. You can see some of them
in the table 3.2.
Table 3.2 Some useful Mapper implementations predefined by Hadoop
Class Description
IdentityMapper<K,V> Implements Mapper<K,V,K,V> and maps inputs directly to outputs
InverseMapper<K,V> Implements Mapper<K,V,V,K> and reverses the key/value pair
RegexMapper<K> Implements Mapper<K,Text,Text,LongWritable> and generates a
(match, 1) pair for every regular expression match
TokenCountMapper<K> Implements Mapper<K,Text,Text,LongWritable> and generates a
(token, 1) pair when the input value is tokenized
As the MapReduce name implies, the major data flow operation after map is the reduce
phase, shown in the bottom part of figure 3.1.