[ANNOUNCE] Luke 7.3.1 released

2018-05-15 Thread Tomoko Uchida
Download the release zip here: https://github.com/DmitryKey/luke/releases/tag/luke-7.3.1 - Upgrade to Lucene 7.3.1. - UI Framework has been sw

Re: Error using multiple terms in function query

2018-05-15 Thread Yonik Seeley
Problems like this are usually caused by the whole query not even making it to Solr due to bad HTTP param encoding. For example, if you're using curl with request parameters in the URL, you need to manually encode spaces as either "+" or "%20" -Yonik On Tue, May 15, 2018 at 7:41 PM, Shamik Bando

Error using multiple terms in function query

2018-05-15 Thread Shamik Bandopadhyay
Hi, I'm having issues using multiple terms in Solr function queries. For e.g. I'm trying to use the following bf function using termfreq bf=if(termfreq(ProductLine,'Test Product'),5,0) This throws org.apache.solr.search.SyntaxError: Missing end to unquoted value starting at 28 str='if(termfre

[JOB] Senior or Principal Search Platform Engineer - Ancestry - San Francisco, CA, USA

2018-05-15 Thread Todd Cranston-Cuebas
I want to be respectful of your user list’s etiquette for posting job announcements. After speaking with Mark Miller, he suggested that if the subject is labeled appropriately the Solr community should welcome learning about a new job opportunity. Please, if you don’t agree, contact me directly

Re: question about updates to shard leaders only

2018-05-15 Thread Mark Miller
Yeah, basically ConcurrentUpdateSolrClient is a shortcut to getting multi threaded bulk API updates out of the single threaded, single update API. The downsides to this are: It is not cloud aware - you have to point it at a server, you have to add special code to see if there are any errors, you do

Re: question about updates to shard leaders only

2018-05-15 Thread Erick Erickson
bq. But don't forget a final client.add(list) after the while-loop ;-) Ha! But only "if (list.size() > 0)" And then there was the memorable time I forgot the "list.clear()" when I sent the batch and wondered why my indexing progress got slower and slower... Not to mention the time I re-used the

Re: question about updates to shard leaders only

2018-05-15 Thread Shawn Heisey
On 5/15/2018 12:12 AM, Bernd Fehling wrote: OK, I have the CloudSolrClient with SolrJ now running but it seams a bit slower compared to ConcurrentUpdateSolrClient. This was not expected. The logs show that CloudSolrClient send the docs only to the leaders. So the only advantage of CloudSolrClien

Re: question about updates to shard leaders only

2018-05-15 Thread Bernd Fehling
Am 15.05.2018 um 14:33 schrieb Erick Erickson: You might find this useful: https://lucidworks.com/2015/10/05/really-batch-updates-solr-2/ I have seen that already and can confirm it. From my observations about a 3x3 cluster with 3 server and my hardware: - have at least 6 CPUs on each server

Re: question about updates to shard leaders only

2018-05-15 Thread Erick Erickson
You might find this useful: https://lucidworks.com/2015/10/05/really-batch-updates-solr-2/ One tricky bit: Assuming docs have a random distribution amongst shards, you should batch so at least 100 docs go to each _shard_. You can see from the link that the speedup is mostly going from 1 to 100. S

Re: question about updates to shard leaders only

2018-05-15 Thread Bernd Fehling
Hi Erik, yes indeed, batching solved it. I used ConcurrentUpdateSolrClient with queue size of 1 but CloudSolrClient doesn't have this feature. I build my own queue now. Ah!!! So I obviously use default NRT but actually don't need it because I don't have any NRT data to index. A latency of se

Re: question about updates to shard leaders only

2018-05-15 Thread Erick Erickson
What did you do to solve your performance problem? Batching updates is one thing that helps performance. bq. I thought that only the leaders are under load until any commit and then replicate to the other replicas. True if (and only if) you're using PULL or TLOG replicas. When using the default

Re: question about updates to shard leaders only

2018-05-15 Thread Bernd Fehling
Thanks, solved, performance is good now. Regards, Bernd Am 15.05.2018 um 08:12 schrieb Bernd Fehling: OK, I have the CloudSolrClient with SolrJ now running but it seams a bit slower compared to ConcurrentUpdateSolrClient. This was not expected. The logs show that CloudSolrClient send the docs o

Re: Regarding LTR feature

2018-05-15 Thread prateek.agarwal
Hi Alessandro, "You are talking about weights so I assume you are using a linear Learning To Rank model. Which library are you using to train your model? Is this library allowing you to limit the summation of the linear weights and normalise the training set per feature? " Yes, we're planning