Re: is ConcurrentUpdateSolrClient.Builder thread safe?

2018-01-11 Thread Bernd Fehling
To sum it up, there is no way for bulk loading in solr, due to the lack of preserving the order of operation. Solr can only supply bulk loading if you really have unique data, right? By the way, the queue used is java.util.concurrent.BlockingQueue. Changing that to ArrayBlockingQueue (to force FIF

Re: is ConcurrentUpdateSolrClient.Builder thread safe?

2018-01-11 Thread Shawn Heisey
On 1/11/2018 1:38 AM, Bernd Fehling wrote: To sum it up, there is no way for bulk loading in solr, due to the lack of preserving the order of operation. Solr can only supply bulk loading if you really have unique data, right? Bulk loading implies that every document is inserted exactly once and

Re: Regarding document routing

2018-01-11 Thread Shawn Heisey
On 1/10/2018 11:00 PM, manish tanger wrote: As we are connecting through zookeeper my understanding was routing will done by a zookeeper, Thanks for the clarification. CloudSolrClient doesn't actually connect through ZK. When you create the client using ZK info, the client reads information a

Call for Presentations FOSS Backstage open

2018-01-11 Thread Isabel Drost-Fromm
Hi, As announced on Berlin Buzzwords we (that is Isabel Drost-Fromm, Stefan Rudnitzki as well as the eventing team over at newthinking communications GmbH) are working on a new conference in summer in Berlin. The name of this new conference will be "FOSS Backstage". Backstage comprises all things

RE: Very high number of deleted docs, part 2

2018-01-11 Thread Markus Jelsma
Yes, i made sure the large test segment had just over 10 % deleted documents. But all that expungeDeletes did was merging that segment with itself, making it just 10 % smaller. It makes sense though. Optimizing with maxSegments is also not a possibility, it will just merge the cheapest segments

Re: Spatial search (and nested docs)

2018-01-11 Thread Emir Arnautović
Hi Leila, You should be able to pass score parameter that will tell how to combine children’s scores to parent’s score: https://lucene.apache.org/solr/guide/6_6/other-parsers.html#OtherParsers-Scoring Maybe you could change approach and index each cell as a separate doc with layer field and use

Re: Learning to Rank (LTR) with grouping

2018-01-11 Thread Roopa Rao
Solution that I implemented currently is: Since we have a web application which takes the solr results and display in the UI and I need LTR enabled for only one of the group, I am executing two parallel queries to Solr from web app. 1st query Get grouped results without LTR 2nd query Get results wi

Re: Ingestion not scaling horizontally as I add more cores to Solr

2018-01-11 Thread Shashank Pedamallu
Thank you very much for the reply Shawn. Is the jmeter running on a different machine from Solr or on the same machine? Solr is running on a dedicated VM. And I’ve tried to split the client requests from multiple machines but the result was not different. So, I don’t think the bottleneck is with

Re: Ingestion not scaling horizontally as I add more cores to Solr

2018-01-11 Thread Kevin Risden
When you say "multiple machines", was these all local machines or vms or something else? I worked with a group once that used laptops to benchmark a service and it was a WiFi network limit that caused weird results. LAN connections or even better a dedicated client machine would help push more docu

Re: Ingestion not scaling horizontally as I add more cores to Solr

2018-01-11 Thread Shashank Pedamallu
Thank you for the reply Kevin. I was using 6 vms from our private cloud. 5 among them, I was using as clients to ingest data on 5 independent cores. One vm is hosting the Solr which is where all ingest requests are received for all cores. Since they are all on same network, I think they should n

Re: ClassicTokenizer

2018-01-11 Thread Steve Rowe
Hi Rick, Quoting Robert Muir’s comments on https://issues.apache.org/jira/browse/LUCENE-2167 (he’s referring to the word break rules in UAX#29[1] when he says “the standard”): > i actually am of the opinion StandardTokenizer should follow unicode standard > tokenization. then we can throw sub

Re: Learning to Rank (LTR) with grouping

2018-01-11 Thread Diego Ceccarelli (BLOOMBERG/ LONDON)
Roopa, did you try collapsing instead of grouping? it would work with ltr.. From: solr-user@lucene.apache.org At: 01/11/18 16:48:26To: solr-user@lucene.apache.org Subject: Re: Learning to Rank (LTR) with grouping Solution that I implemented currently is: Since we have a web application which t

Re: Learning to Rank (LTR) with grouping

2018-01-11 Thread Roopa Rao
Hi Diego, I tried collapsing, unfortunately we are using a third party tool for Query processing, that does not support collapsing. Hence I am unable to go that route. Thank you, Roopa On Thu, Jan 11, 2018 at 2:59 PM, Diego Ceccarelli (BLOOMBERG/ LONDON) < dceccarel...@bloomberg.net> wrote: > R

Re: Ingestion not scaling horizontally as I add more cores to Solr

2018-01-11 Thread Shawn Heisey
On 1/11/2018 11:50 AM, Shashank Pedamallu wrote: > Thank you for the reply Kevin. I was using 6 vms from our private cloud. 5 > among them, I was using as clients to ingest data on 5 independent cores. One > vm is hosting the Solr which is where all ingest requests are received for > all cores.

Combine Results with Two different Collections.

2018-01-11 Thread Fiz Newyorker
Hi Team, Could you please let me know how to handle the below scenario. I have Two Collections *Accounts & Content.* I am trying to search term *"biodata". * from Accounts Collections I am getting the output *sample : your biodata* from Content Collections I am getting the output *title :

Re: Combine Results with Two different Collections.

2018-01-11 Thread Mikhail Khludnev
I believe it supports collection=accounts,content param, but can't find it in documentation. On Fri, Jan 12, 2018 at 2:27 AM, Fiz Newyorker wrote: > Hi Team, > > Could you please let me know how to handle the below scenario. > > I have Two Collections *Accounts & Content.* > > I am trying to se

Re: regarding exposing merge metrics

2018-01-11 Thread suresh pendap
Hi Shawn, Thanks for replying to my questions. So is it correct to assume that exposing merge metrics is not known to cause any performance degradation? -suresh On Wed, Jan 10, 2018 at 5:40 PM, Shawn Heisey wrote: > On 1/10/2018 11:08 AM, S G wrote: > >> Last comment by Shawn on SOLR-10130 is:

Re: Combine Results with Two different Collections.

2018-01-11 Thread Fiz Newyorker
Yes Mikhail, even I couldn't find it. Hi Solr-user team, Requesting your support. Thanks Fiz. On Thu, Jan 11, 2018 at 8:06 PM, Mikhail Khludnev wrote: > I believe it supports collection=accounts,content param, but can't find it > in documentation. > > On Fri, Jan 12, 2018 at 2:27 AM, Fiz Newy

Re: Combine Results with Two different Collections.

2018-01-11 Thread Suman Saurabh
Try using solr streaming api. https://lucene.apache.org/solr/guide/6_6/streaming-expressions.html Sample query: innerJoin( select(search(, q=, fl=, sort=, qt="/export"), ), select(search(search(, q=, fl=, sort=, qt="/export"), ), on=) Note : Both collections must have at least one common field to