Map Reduce and Hadoop

Process

  1. Programmer defines map and reduce functions.
  2. Map input is a very large file. Hadoop distributes it among CPUs.
  3. Run map functions on section of input.
  4. Output keys and their values are accumulated and sorted.
  5. Apply reduce to all the values of each output key.

José M. Vidal .

18 of 24