Hi Joel,

Thanks for the information.

Regards,
Edwin


On 25 March 2017 at 10:15, Joel Bernstein <joels...@gmail.com> wrote:

> The innerJoin is a merge join and the hashJoin is a hash join.
>
> The merge join can support joins of unlimited size and never runs out of
> memory. But it requires that both sides of the join are sorted on the join
> keys.
>
> The hash join reads one side of the join into a hash map keyed on the join
> keys. This doesn't require any specific sort but it is limited in size by
> how much data can fit in the hash map.
>
> You can parallelize both joins using the parallel function to improve
> scalability and performance.
>
> Joel Bernstein
> http://joelsolr.blogspot.com/
>
> On Fri, Mar 24, 2017 at 4:49 AM, Zheng Lin Edwin Yeo <edwinye...@gmail.com
> >
> wrote:
>
> > Hi,
> >
> > What is the main difference between hashJoin and innerJoin in Solr
> > Streaming Expression?
> >
> > I understand that both will emit a tuple containing the fields of both
> > tuples.
> >
> > When I tried both hashJoin and innerJoin with the same query, I get
> exactly
> > the same results, and there is no difference in performance.
> >
> > Under what circumstances should we use hashJoin, and under what
> > circumstances should we use innerJoin?
> >
> > Regards,
> > Edwin
> >
>

Reply via email to