Tuesday, 27 August 2013

Partitioning ! how dose the hadoop make it? Use a hash function ? what is the default function?

Partitioning ! how dose the hadoop make it? Use a hash function ? what is
the default function?

Partitioning is the process of determining which reducer instance will
receive which intermediate keys and values. Each mapper must determine for
all of its output (key, value) pairs which reducer will receive them. It
is necessary that for any key, regardless of which mapper instance
generated it, the destination partition is the same Problem: how dose the
hadoop make it? Use a hash function ? what is the function?

No comments:

Post a Comment