Process
- Programmer defines map and reduce functions.
- Map input is a very large file. Hadoop distributes it among
CPUs.
- Run map functions on section of input.
- Output keys and their values are accumulated and
sorted.
- Apply reduce to all the values of each output key.
José M. Vidal
.
18 of 24