And you want to show to the users only the Lucene documents that matched the original query sent to Solr? (what if a lucene document matches only part of the query?)
From: solr-user@lucene.apache.org At: 01/23/18 13:55:46To: Diego Ceccarelli (BLOOMBERG/ LONDON ) , solr-user@lucene.apache.org Subject: RE: Using lucene to post-process Solr query results Hi Diego, Basically, each Solr document has a text field , which contains large amount of text separated by some delimiters. I split this text into parts and then assign each part to a separate lucene Document object. The field could also be multi-valued, in which case I create a Lucene document for each different value for that field in the same Solr document. Regards, Rahul -----Original Message----- From: Diego Ceccarelli (BLOOMBERG/ LONDON) [mailto:dceccarel...@bloomberg.net] Sent: Tuesday, January 23, 2018 7:17 PM To: solr-user@lucene.apache.org Subject: Re: Using lucene to post-process Solr query results Rahul, can you provide more details on how you decide that the smaller lucene objects are part of the same solr document? From: solr-user@lucene.apache.org At: 01/23/18 09:59:17To: solr-user@lucene.apache.org Subject: Re: Using lucene to post-process Solr query results Hi Rahul, Looks like Streaming expressions can probably can help you. Is there something else you have tried for this? Atita On Jan 23, 2018 3:24 PM, "Rahul Chhiber" <rahul.chhi...@cumulus-systems.com> wrote: Hi All, For our business requirement, once our Solr client (Java) gets the results of a search query from the Solr server, we need to further search across and also within the content of the returned documents. To accomplish this, I am attempting to create on the client-side an in-memory lucene index (RAMDirectory), convert the SolrDocument objects into smaller lucene Document objects, add them into the index and then search within it. Has something like this been attempted yet? And does it sound like a workable idea ? P.S. - Reason for this approach is basically that we need search on the data at a certain fine granularity but don't want to index the data at such high granularity for indexing performance reasons i.e. we need to keep the total number of documents small. Appreciate any help. Regards, Rahul Chhiber