MapReduce - Wikipedia, the free encyclopedia
Popularity Report
![]() |
|||
![]() |
|||
![]() |
|||
![]() |
|||
![]() |
|||
![]() |
URL Tag Cloud
Bookmark History
Saved by 20 people (-3 private), first by anonymouse user on 2006-08-17
- Rg443322 on 2009-11-02 - Tags MapReduce , Wikipedia , Google
- Burkestar on 2009-05-05 - Tags no_tag
- Pierretran on 2009-05-03 - Tags distributed computing , parallel computing , mapreduce , google
- Kenyth on 2009-04-30 - Tags distributed , algorithm , wikipedia , scalability , Google
- Shon__ on 2009-04-26 - Tags no_tag
Public Sticky notes
Highlighted by pseudoking
Highlighted by pseudoking
Highlighted by pseudoking
Highlighted by doxyer
Highlighted by kenyth
Highlighted by doxyer
Highlighted by pseudoking
Highlighted by doxyer
Highlighted by doxyer
Highlighted by doxyer
Highlighted by kenyth
Map takes one pair of data with a type on a data domain, and returns a list of pairs in a different domain:
Map(k1,v1) -> list(k2,v2)
Highlighted by kenyth
Highlighted by kenyth
Highlighted by doxyer
Highlighted by kenyth
The Reduce function is then applied in parallel to each group, which in turn produces a collection of values in the same domain:
Reduce(k2, list (v2)) -> list(v2)
Highlighted by kenyth
Highlighted by doxyer
Highlighted by kenyth
Highlighted by doxyer
Highlighted by kenyth
The hot spots, which the application defines, are:
- an input reader
- a Map function
- a partition function
- a compare function
- a Reduce function
- an output writer
Highlighted by doxyer
Highlighted by doxyer
Highlighted by kenyth
Highlighted by doxyer


Public Comment