Re: Joins with SolrCloud

2015-12-11 Thread Joel Bernstein
You can also do the innerJoin in parallel across worker nodes using the parallel function: hashJoin( parallel(workerCollection, innerJoin( search(users, q="*:*", fl="userId, full_name, hometown", sort="userId a

Re: Joins with SolrCloud

2015-12-11 Thread Dennis Gove
Something I forgot to mention - the collection shards can live on any number of machines, anywhere in the world. As long as the clusterstate in zk knows where the shard can be found (ie, a basis of SolrCloud) then everything will work. The example I gave had the shards living on the same machine bu

Re: Joins with SolrCloud

2015-12-11 Thread Dennis Gove
Mugeesh, You can use Streaming Aggregation to provide various types of cross-collection joins. This is currently available in trunk and will be a part of Solr 6. To follow with your example, let's assume the following setup: Restaurants: avail on machine1:8983 with 3 shards, zk at zk1:2345 Users:

Re: Joins with SolrCloud

2013-07-04 Thread Yonik Seeley
Yes, joins support distributed search fine, provided that the individual documents that are joined reside on the same shard. For example, if you are modeling blogs and posts (one blog object as many posts) shard1 -- joe!blog_info joe!post1 shard2 -- mary!blog_info

Re: Joins with SolrCloud

2013-07-04 Thread slevytam
Hi Yonik, Thanks for the reply. It was very helpful. This may be a newb question but will this work on a individual rows of a query or do all the queries' results need to be on the same shard. ex. if the main query would return - user15 (shard 1) - user16 (shard 2) - user17 (shard 3) is it a

Re: Joins with SolrCloud

2013-07-03 Thread Yonik Seeley
On Wed, Jul 3, 2013 at 5:40 PM, slevytam wrote: > Hi Yonik, > > Can you offer any insight as to how one might ensure that documents reside > on the same shard as the document you'd like them to join. > > For example: > I'd like to do a simple join of user actions to a specific document. So, i > w

Re: Joins with SolrCloud

2013-07-03 Thread slevytam
Hi Yonik, Can you offer any insight as to how one might ensure that documents reside on the same shard as the document you'd like them to join. For example: I'd like to do a simple join of user actions to a specific document. So, i would query for a list of documents and have the user actions on

Re: Joins with SolrCloud

2013-06-28 Thread Yonik Seeley
On Tue, Jun 25, 2013 at 7:55 PM, Upayavira wrote: > However, if from your example, innerCollection was replicated across all > nodes, I would think that should work, because all that comes back from > each server when a distributed search happens is the best 'n' matches, > so exactly how those 'n'

Re: Joins with SolrCloud

2013-06-28 Thread Chris Toomey
Thanks, confirmed by trying w/ 4.3.1 that the join works with the outer collection distributed/sharded so long as the inner collection is not distributed/sharded. Chris On Tue, Jun 25, 2013 at 4:55 PM, Upayavira wrote: > I have never heard mention that joins support distributed search, so you

RE: Joins with SolrCloud

2013-06-25 Thread James Thomas
IRA above -- James -Original Message- From: Upayavira [mailto:u...@odoko.co.uk] Sent: Tuesday, June 25, 2013 7:55 PM To: solr-user@lucene.apache.org Subject: Re: Joins with SolrCloud I have never heard mention that joins support distributed search, so you cannot do a join against a sharded core. H

Re: Joins with SolrCloud

2013-06-25 Thread Upayavira
I have never heard mention that joins support distributed search, so you cannot do a join against a sharded core. However, if from your example, innerCollection was replicated across all nodes, I would think that should work, because all that comes back from each server when a distributed search h