PaperReading20200424

CanChen ggchen@mail.ustc.edu.cn


 

MapReduce

1.PNG-53.2kB
The MapReduce model in user program first split up the input file into M parts and then forks user program to every node in the cluster. One node is special and called master. Master node assigns M map tasks to nodes and these nodes process the corresponding data part and write the results in the local disk. Here written data is splitted up into R parts and after that the node which receives reduce task will read one part of the buffered data remotely. The results of R tasks are then written out as the final result. For me I think the R results should be reduced to the real final result.

 

posted @ 2020-04-25 21:16  Klaus-Chen  阅读(82)  评论(0编辑  收藏  举报