Re: MapReduce shuffling phase

Ranadip Chatterjee Wed, 31 Aug 2022 23:23:01 -0700

The reducer nodes handle shuffle in legacy Mapreduce in Hadoop. More modern
frameworks have the option to configure an external shuffle service in
which case it can be run elsewhere.


On Wed, 31 Aug 2022, 20:42 Pratyush Das, <[email protected]> wrote:

> Hi,
>
> Which node on HDFS is MapReduce's "Shuffle" phase that aggregates all
> values corresponding to a key, performed on?
>
> The Map phase happens on the datanode containing a block. I assume that
> the Reduce phase happens on some arbitrary free node. But which node is the
> shuffle phase performed on? (since it aggregates values from all datanodes
> before passing it to the Reducer)
>
> Is the Shuffle phase performed on the client node?
>
> Thank you,
>
> --
> Pratyush Das
>

Re: MapReduce shuffling phase

Reply via email to