the options of ManagedSynonym(Graph)Filter

2020-02-05 Thread Kayak28
Hello, Comunity Members: I use Solr8.4.1 and the primary language of documents in the Solr is Japanese. I have a question about ManagedSynonymFilters. Before that, let me introduce my story first. I would like to use ManagedSynonymGraphFilter, and ManagedSynonymFilter, for my synonym expansion.

Re: Bug? Documents not visible after sucessful commit - chaos testing

2020-02-05 Thread Chris Hostetter
I may be missunderstanding something in your setup, and/or I may be miss-remembering things about Solr, but I think the behavior you are seeing is because *search* in solr is "eventually consistent" -- while "RTG" (ie: using the /get" handler) is (IIRC) "strongly consistent" ie: there's a rea

Advice - on recovering shards

2020-02-05 Thread dj-manning
Hello, I've run into the situation where 1 shard in a 6 shard 2 replica index had become unavailable because both replicala were not available: 1 Replica was in a recovery state the other replica was down I believe the best course of action would be to: 1. allow the recovering shard to complet

Checking in on Solr Progress

2020-02-05 Thread dj-manning
Hi - I'm wondering if you would be able to point me in the right direction - I'm looking for the best way to check solr recover progress and status. I've seen a replica fall into recovery and I was wondering where I should look to monitor progress. Thank you in advance. -- Sent from: https://

NoClassDefFoundError - Faceting on 8.2.0

2020-02-05 Thread Joe Obernberger
Hi All - getting this error intermittently on a solr cloud cluster.  Sometimes the heatmap generation works, sometimes not.  I tracked it down to some of the nodes are reporting this error: null:java.lang.RuntimeException: java.lang.NoClassDefFoundError: Could not initialize class org.apache.s

StatelessScriptUpdateProcessorFactory causing OOM errors?

2020-02-05 Thread Haschart, Robert J (rh9ec)
I've recently started looking at using the updateRequestProcessorChain to ensure the presence of certain fields in our solr records. The reason for doing so is that we have records from several different sources, that are processed in different ways, and by adding the field via the updateRequ

Re: Need help in configuring Spell check in Apache Solr 8.4

2020-02-05 Thread kumar gaurav
HI Seetesh For IndexBasedSpellchecker default distanceMeasure is LevensteinDistance itself . Thats why it is commented in the Reference Guide regards Kumar Gaurav On Tue, Jan 28, 2020 at 1:01 PM seeteshh wrote: > Hello Kumar Gaurav > > For IndexBasedSpellchecker is there a better option of us

DataImportHandler SolrEntityProcessor configuration for local copy

2020-02-05 Thread Karl Stoney
Hey All, I'm trying to implement a simplistic reindex strategy to copy all of the data out of one collection, into another, on a single node (no distributed queries). It's approx 4 million documents, with an index size of 26gig. Based on your experience, I'm wondering what people feel sensible

Re: Solr Cloud on Docker?

2020-02-05 Thread Karl Stoney
Nothing much to add to the below apart from we also successfully run solr on kubernetes. It took some implementation effort but we're now at a point where we can do `kubectl scale --replicas=x statefulset/solr` and increase capacity in minutes with solr's autoscaling taking care of the new shar

Re: Solr Cloud on Docker?

2020-02-05 Thread Dominique Bejean
Thank you Dwane. Great info :) Le mer. 5 févr. 2020 à 11:49, Dwane Hall a écrit : > Hey Dominique, > > From a memory management perspective I don't do any container resource > limiting specifically in Docker (although as you mention you certainly > can). In our circumstances these hosts are us

Re: Number of requested rows

2020-02-05 Thread Emir Arnautović
Hi Toke, Thanks for the post. Good that things are moving forward! It took a while! Emir -- Monitoring - Log Management - Alerting - Anomaly Detection Solr & Elasticsearch Consulting Support Training - http://sematext.com/ > On 5 Feb 2020, at 15:23, Toke Eskildsen wrote: > > On Wed, 2020-02-0

Re: Number of requested rows

2020-02-05 Thread Toke Eskildsen
On Wed, 2020-02-05 at 13:00 +0100, Emir Arnautović wrote: > I was thinking in that direction. Do you know where it is in the > codebase or which structure is used - I am guessing some array of > objects? Yeah. More precisely a priority queue of Objects, initialized with sentinel Objects. rows=1000

Bug? Documents not visible after sucessful commit - chaos testing

2020-02-05 Thread Michael Frank
Hi All, In our Solr Cloud cluster (8.4.1) sometimes committed documents are not visible to subsequent requests sent after a, apprently, sucessful commit(waitFlush=true, wait=searcherTrue). This behaviour does not happen if all nodes are stable, but will happen eventually if we kill off random node

Re: Submitting sugestions?

2020-02-05 Thread Erick Erickson
Create an account in the JIRA system and go for it. https://issues.apache.org/jira Lots of info here: https://cwiki.apache.org/confluence/display/solr/HowToContribute#HowToContribute-JIRAtips(ourissue/bugtracker) Best, Erick > On Feb 5, 2020, at 4:03 AM, BlackIce wrote: > > Hi, > > I'm a co

Re: Number of requested rows

2020-02-05 Thread Emir Arnautović
Thanks a lot! Emir -- Monitoring - Log Management - Alerting - Anomaly Detection Solr & Elasticsearch Consulting Support Training - http://sematext.com/ > On 5 Feb 2020, at 13:27, Mikhail Khludnev wrote: > > Hi, Emir. > > Please check callers of org.apache.lucene.search.HitQueue.HitQueue(int

Re: Number of requested rows

2020-02-05 Thread Mikhail Khludnev
Hi, Emir. Please check callers of org.apache.lucene.search.HitQueue.HitQueue(int, boolean), you may found an alternative usage you probably is looking for. On Wed, Feb 5, 2020 at 3:01 PM Emir Arnautović wrote: > Hi Mikhail, > I was thinking in that direction. Do you know where it is in the code

Re: Number of requested rows

2020-02-05 Thread Emir Arnautović
Hi Mikhail, I was thinking in that direction. Do you know where it is in the codebase or which structure is used - I am guessing some array of objects? Thanks, Emir -- Monitoring - Log Management - Alerting - Anomaly Detection Solr & Elasticsearch Consulting Support Training - http://sematext.com

Re: Number of requested rows

2020-02-05 Thread Mikhail Khludnev
Absolutely. Searcher didn't know number of hits a priory. It eagerly allocate results heap before collecting results. The only cap I'm aware of is maxDocs. On Wed, Feb 5, 2020 at 2:42 PM Emir Arnautović wrote: > Hi, > Does somebody know if requested number of rows is used internally to set > som

Number of requested rows

2020-02-05 Thread Emir Arnautović
Hi, Does somebody know if requested number of rows is used internally to set some temp structures? In other words will query with rows=100 be more expensive than query with rows=1000 if number of hits is 1000? Thanks, Emir -- Monitoring - Log Management - Alerting - Anomaly Detection Solr &

Re: Solr Cloud on Docker?

2020-02-05 Thread Dwane Hall
Hey Dominique, >From a memory management perspective I don't do any container resource >limiting specifically in Docker (although as you mention you certainly can). >In our circumstances these hosts are used specifically for Solr so I planned >and tested my capacity beforehand. We have ~768G o

Submitting sugestions?

2020-02-05 Thread BlackIce
Hi, I'm a comitter at another project. I just came back from Fosdem and did see some very interesting things some of which I think the Solr-Lucene project could benefit from... How do I go about opening a "Ticket"? Thnx Have a Great day!

Re: Filtered join in Solr?

2020-02-05 Thread Radu Gheorghe
Hi Edward, Thanks a lot for your reply! Subquery is what I had in mind, too, for designs 1) and 3) to bring back the other side of the relationship. Except that I always queries movies and subqueried users. If I do it the other way around, like you did, then I'm able to filter. I can't quite fil