RE: Solr 7.7 Cloud - can it adapt to Zookeeper address changes while running?

2019-02-25 Thread Addison, Alex (LNG-LON)
Is there a guide to doing this with elastic IPs or Route 53 that you're aware of? If not, I'll aim to publish one as a blog entry. This is the solution we're working on, but I thought it was worth asking the community in case there was a simpler way. -Original Message- From: Jörn Franke

MLT and facetting

2019-02-25 Thread Martin Frank Hansen (MHQ)
Hi, I am trying to combine the mlt functionality with facets, but Solr throws org.apache.solr.common.SolrException: ":"Unable to compute facet ranges, facet context is not set". What I am trying to do is quite simple, find similar documents using mlt and group these using the facet parameter.

Re: Is anyone using proxy caching in front of solr?

2019-02-25 Thread Edward Ribeiro
Maybe you could add a length filter factory to filter out queries with 2 or 3 characters using https://lucene.apache.org/solr/guide/7_4/filter-descriptions.html#FilterDescriptions-LengthFilter ? PS: this filter requires a max length too. Edward Em qui, 21 de fev de 2019 04:52, Furkan KAMACI esc

Re: Is anyone using proxy caching in front of solr?

2019-02-25 Thread Walter Underwood
Don’t worry about one and two character queries, because they will almost always be served from cache. There are only 26 one-letter queries (36 if you use numbers). Almost all of those will be in the query results cache and will be very fast with very little server load. The common two-letter q

Re: Is anyone using proxy caching in front of solr?

2019-02-25 Thread Michael Gibney
Tangentially related, possibly of interest regarding solr-internal cache hit ratio (esp. with a lot of replicas): https://issues.apache.org/jira/browse/SOLR-13257 On Mon, Feb 25, 2019 at 11:33 AM Walter Underwood wrote: > Don’t worry about one and two character queries, because they will almost

Re: Is anyone using proxy caching in front of solr?

2019-02-25 Thread Walter Underwood
Multiple caches can have the same hit rate as a single cache if the same query is always sent back to the same replica. This works great until a replica goes down. If the queries are redistributed, all the caches have the wrong content, very expensive. Instead. the queries need to be redistribut

how to get high-availability for Solr csv update handler?

2019-02-25 Thread Ganesh Sethuraman
Hi We are using Solr Cloud 7.2.1. We are using Solr CSV update handler to do bulk update (several Millions of docs) in to multiple collections. When we make a call to the CSV update handler using curl command line (as below), we are pointing to single server in Solr. During the problem time, when

Schema configuration field defaults

2019-02-25 Thread Dionte Smith
Hi, I have two questions about the field default values for multivalued and indexed. 1. Is it possible to make new fields have the indexed attribute set to false by default for a schema? I understand this wouldn't normally be the case, but we have a use case where it would be preferable as

Re: how to get high-availability for Solr csv update handler?

2019-02-25 Thread Shawn Heisey
On 2/25/2019 11:15 AM, Ganesh Sethuraman wrote: We are using Solr Cloud 7.2.1. We are using Solr CSV update handler to do bulk update (several Millions of docs) in to multiple collections. When we make a call to the CSV update handler using curl command line (as below), we are pointing to single

Re: Schema configuration field defaults

2019-02-25 Thread Erick Erickson
Sure. In both cases define a fieldType with those attributes set however you want. Any field that is defined with that fieldType will have the defaults you specify unless overridden on the field definition itself. Best, Erick > On Feb 25, 2019, at 9:08 AM, Dionte Smith wrote: > > Hi, > > I h

Re: how to get high-availability for Solr csv update handler?

2019-02-25 Thread Ganesh Sethuraman
Thanks for details and updates. We are looking at load balancers not because of the little improvement in performance. But more for high availability. Other alternative is, if the update fails on one server using curl, on error we have to call another SOLR server. I was looking to see if there any

Re: how to get high-availability for Solr csv update handler?

2019-02-25 Thread Walter Underwood
We send batches of updates to a load balancer. The cluster gets the updates to the right leader with very little overhead. When we get an error, we resend the update batch. The load balancer will find a healthy node to receive it. This is simple, robust, and fast. One handy tip: if a batch fail

Re: MLT and facetting

2019-02-25 Thread Zheng Lin Edwin Yeo
Hi Martin, I think there are some pictures which are not being sent through in the email. Do send your query that you are using, and which version of Solr you are using? Regards, Edwin On Mon, 25 Feb 2019 at 20:54, Martin Frank Hansen (MHQ) wrote: > Hi, > > > > I am trying to combine the mlt

Re: MLT and facetting

2019-02-25 Thread Dave
Use the mlt to get the queries to use for getting facets in a two search approach > On Feb 25, 2019, at 10:18 PM, Zheng Lin Edwin Yeo > wrote: > > Hi Martin, > > I think there are some pictures which are not being sent through in the > email. > > Do send your query that you are using, and wh

RE: MLT and facetting

2019-02-25 Thread Martin Frank Hansen (MHQ)
Hi Edwin, Thanks for your response. Yes you are right. It was simply the search parameters from Solr. The query looks like this: http://.../solr/.../mlt?df=text&facet.field=Journalnummer&facet=on&fl=id,Journalnummer&q=id:*6512815* best regards, Martin Internal - KMD A/S -Original Mes

RE: MLT and facetting

2019-02-25 Thread Martin Frank Hansen (MHQ)
Hi Dave, Thanks for your suggestion, I was under the impression that you could do it in one-search approach. But if that’s not possible I will try to divide into two searches. Is the best way to do this through Solrj? Best regards Martin Internal - KMD A/S -Original Message- Fro

LTR feature based on other collection data

2019-02-25 Thread Kamal Kishore Aggarwal
Hi, I am working on LTR using solr 6.6.2. I am working on custom feature creation. I am able to create few custom features as per our requirement. But, there are certain features, for which the data is stored in other collection. Data like count of clicks, last date when the product was ordered,

RE: MLT and facetting

2019-02-25 Thread Martin Frank Hansen (MHQ)
Sorry forgot to mention that we are using Solr 7.5. Internal - KMD A/S -Original Message- From: Martin Frank Hansen (MHQ) Sent: 26. februar 2019 07:43 To: solr-user@lucene.apache.org Subject: RE: MLT and facetting Hi Edwin, Thanks for your response. Yes you are right. It was simpl

SOLR Tokenizer “solr.SimplePatternSplitTokenizerFactory” splits at unexpected characters

2019-02-25 Thread Stephan Damson
Hi! I'm having unexpected results with the solr.SimplePatternSplitTokenizerFactory. The pattern used is actually from an example in the SOLR documentation and I do not understand where I made a mistake or why it does not work as expected. If we take the example input "operative", the analyzer sh

Re: MLT and facetting

2019-02-25 Thread Zheng Lin Edwin Yeo
Hi Martin, What is your setting in your /mlt requestHandler in solrconfig.xml? Regards, Edwin On Tue, 26 Feb 2019 at 14:43, Martin Frank Hansen (MHQ) wrote: > Hi Edwin, > > Thanks for your response. > > Yes you are right. It was simply the search parameters from Solr. > > The query looks like