Hi Joel,
Thanks for the information.
Regards,
Edwin
On 25 March 2017 at 10:15, Joel Bernstein wrote:
> The innerJoin is a merge join and the hashJoin is a hash join.
>
> The merge join can support joins of unlimited size and never runs out of
> memory. But it requires that both sides of the j
The innerJoin is a merge join and the hashJoin is a hash join.
The merge join can support joins of unlimited size and never runs out of
memory. But it requires that both sides of the join are sorted on the join
keys.
The hash join reads one side of the join into a hash map keyed on the join
keys.