Cannot set pollInterval in SolrCloud for PULL or TLOG replica

2019-04-15 Thread Dmitry Vorotilin
Hi everyone, We have SolrCloud cluster with 3 zk and 3 solr nodes. It's 1 shard only and all replicas are PULL. We have bulk updates so like once a day we reindex all cores (no soft commits, only hard commit every 15s), do commit with openSearcher=true and all our indexes become available for sear

Re: Multivalue Field lookup

2019-04-15 Thread Kumaresh AK
Thanks Edwin for the information. I will try highlighting.. On Sun, Apr 14, 2019 at 7:53 PM Zheng Lin Edwin Yeo wrote: > You need to set the field which you want to highlight in the hl.fl > parameter in your solrconfig.xml. > Highlighting can work in String field. > > Regards, > Edwin > > On Mon

Re: Optimal RAM to size index ration

2019-04-15 Thread Jeff Courtade
In linux the os will cache files in ram for quick reading. You can force the into ram by doing cat filename >/dev/null I do this with all my index files after a reboot and see better performance times on queries. Optimal ram is enough ram for all the indexes plus jvm plus 20 percent... Generall

Mistake assert tips in FST builder ?

2019-04-15 Thread zhenyuan wei
Hi, With current newest version, 9.0.0-snapshot,In Builder.UnCompileNode.addArc() function, found this line: assert numArcs == 0 || label > arcs[numArcs-1].label: "arc[-1].label=" + arcs[numArcs-1].label + " new label=" + label + " numArcs=" + numArcs; Maybe assert tips is : assert numArcs ==

Re: Solr 8.0.0 Delta import add/delete data

2019-04-15 Thread Zheng Lin Edwin Yeo
Hi Anuj, I think it could be faster and cleaner to re-index, since the indexing takes only 4 minutes, and you require records to be deleted. Also, when you delete records in Solr, it only annotates them as deleted for the purpose of searching. The space used by those documents will be reclaimed w

JSON Facet query to retrieve count all collections in Solr 8.0.0

2019-04-15 Thread Zheng Lin Edwin Yeo
Hi, I am using the below JSON Facet to retrieve the count of all the different collections in one query. https://localhost:8983/solr/collection1/select?q=testing&shards=https://localhost:8983/solr/collection1,https://localhost:8983/solr/collection2,https://localhost:8983/solr/collection3,https://

Re: Topic & Executor streaming expressions

2019-04-15 Thread Joel Bernstein
This blog covers this topic is some depth: https://joelsolr.blogspot.com/2017/01/deploying-solrs-new-parallel-executor.html Joel Bernstein http://joelsolr.blogspot.com/ On Mon, Apr 15, 2019 at 11:19 AM Nazerke S wrote: > Hi everyone! > > Can anyone elaborate the topic and executor streaming

Solr 7 Nodes Suck in "Gone" State

2019-04-15 Thread Marko Babic
Hi everyone, I ran into a situation in which a portion of the nodes in a Solr cluster were stuck in the GONE state (no ephemeral znode under /live_nodes) despite the Solr processes being alive and connected to the ZooKeeper tier. For context this a cluster running Solr 7.7.1 and ZooKeeper 3.4.13

Re: Highlighting

2019-04-15 Thread Shawn Heisey
On 4/15/2019 11:36 AM, Mike Phillips wrote: I don't understand why highlighting does not return anything but the document id. I created a core imported all my data, everything seems like it should be working. From reading the documentation I expect it to show me highlight information for asset

Highlighting

2019-04-15 Thread Mike Phillips
I don't understand why highlighting does not return anything but the document id. I created a core imported all my data, everything seems like it should be working. From reading the documentation I expect it to show me highlight information for assetName around Potter, but I never get anything bu

Re: Optimal RAM to size index ration

2019-04-15 Thread SOLR4189
All my queries from production environments, from real customers. I build query player that runs queries in the same time intervals like in PRODUCTION (all customers' queries with time intervals between them are saved in splunk). So all queries are distinct. -- Sent from: http://lucene.472066.n3

Re: Optimal RAM to size index ration

2019-04-15 Thread Erick Erickson
To pile on a bit: Your *.fdt files contain “stored=true” data. By and large I ignore them for this discussion. Say I execute a query with “rows=10”. The fdt (and fdx) files are only accessed for the 10 docs returned so they have little impact on query time. Or rather, they have a reasonably con

Topic & Executor streaming expressions

2019-04-15 Thread Nazerke S
Hi everyone! Can anyone elaborate the topic and executor streaming expressions? What I understand from the Solr reference guide was that the topic allows for subscribing to a query. So that whenever I execute a query, it returns the tuples that are not seen yet ?? What about executor function? Is

Re: Optimal RAM to size index ration

2019-04-15 Thread BlackIce
I'm not that proficient with Solr.. I used it, but I'd yet have to fully dive into it, but this topic really interests me. In those 8 hour tests, does ALL information get accessed, or just partial? That could be a reason as to why you don't see any difference, that the test in that time period onl

Re: Shard and replica went down in Solr 6.1.0

2019-04-15 Thread vishal patel
Thanks for your reply. Get Outlook for Android From: Shawn Heisey Sent: Monday, April 15, 2019 12:40:59 AM To: solr-user@lucene.apache.org Subject: Re: Shard and replica went down in Solr 6.1.0 On 4/13/2019 9:29 PM, vishal patel wrote

Re: Optimal RAM to size index ration

2019-04-15 Thread Shawn Heisey
On 4/15/2019 7:25 AM, SOLR4189 wrote: I have a collection with many shards. Each shard is in separate SOLR node (VM) has 40Gb index size, 4 CPU and SSD. When I run performance checking with 50GB RAM (10Gb for JVM and 40Gb for index) per node and 25GB RAM (10Gb for JVM and 15Gb for index), I get

Re: Optimal RAM to size index ration

2019-04-15 Thread Emir Arnautović
Hi, The recommendation to have RAM enough to place your entire index into memory is sort of worst case scenario (maybe better called the best case scenario) where your index is optimal and is fully used all the time. OS will load pages that are used and those that might be used to memory, so eve

CDCR - shards not in sync

2019-04-15 Thread Jay Potharaju
Hi, I have a collection with 8 shards. 6 out of the shards are in sync but the other 2 are lagging behind by more than 10 plus hours. The tlog is only 0.5 GB in size. I have tried stopping and starting CDCR number of times but it has not helped. >From what i have noticed there is always a shard tha

Re: Optimal RAM to size index ration

2019-04-15 Thread SOLR4189
No, I don't load index to RAM, but I run 8 hours queries, so OS must load necessary files (segments) to RAM during my tests. So in the case where I set 25GB for RAM, not all files will be loaded to RAM and I thought I'll see degradation in queries times, but I didn't -- Sent from: http://lucene.

Re: Filters and data cleansing

2019-04-15 Thread Emir Arnautović
Hi Ken, What Solr returns is stored value which is original value. Analysis is applied and its result is stored as “index” and is used for searching. In order to get what you want, you have to move analysis at least one step earlier. It can be moved to update request processor chain where you ap

Re: Optimal RAM to size index ration

2019-04-15 Thread BlackIce
Do you load the index onto a RAM disk? I was under the impression that the JVM had everything contained that had to do with SOLR (I might be wrong), if thats the case and you are not loading the Index onto a ram disk then you won't see any difference. in either scenario, I don't think you would see

Filters and data cleansing

2019-04-15 Thread Ken Wiltshire
hello experts. I have what is probably a simple question. Feels like it should be. i have some filters set up on INDEX. Lets say "lowercasefilterfactory" for instance. I understand the data will be indexed as lowercase but when i qry this same data its still in its original form. This works f

Re: Set Basic Auth to CloudSolrStream

2019-04-15 Thread Jan Høydahl
It may not be well supported Check out https://issues.apache.org/jira/browse/SOLR-9779 for some hints. Contributions welcome for improving how SolrJ can pass different auth credentials both for Streaming and search. -- Jan Høydahl, search soluti

Optimal RAM to size index ration

2019-04-15 Thread SOLR4189
Hi all, I have a collection with many shards. Each shard is in separate SOLR node (VM) has 40Gb index size, 4 CPU and SSD. When I run performance checking with 50GB RAM (10Gb for JVM and 40Gb for index) per node and 25GB RAM (10Gb for JVM and 15Gb for index), I get the same queries times (percen

RE: Multivalue Field lookup

2019-04-15 Thread Wolf, Karl (NIH/NLM/LHC) [C]
I use highlighting to get the list of "hits" from queries a multi-valued String fields. This works fine except when the query contains a range clause on that field. Even though the correct documents are found, the highlighted "hits" contain no data values. This is the reason for my Solr jira bug

NPE in CharsRefBuilder

2019-04-15 Thread Markus Jelsma
Hello, I made a ConditionalTokenFilter filter and factory. Its Lucene based unit tests work really well, and i can see it is doing something, queries are differently analyzed based on some condition. But when debugging through the GUI i get the following: 2019-04-15 12:37:42.219 ERROR (qtp8156

Set Basic Auth to CloudSolrStream

2019-04-15 Thread Novin Novin
Hi How can I set basic auth for CloudSolrStream? I couldn't find any documentation. Can someone please point me in the right direction? Thanks in advance, Navin

Re: nested documents performance

2019-04-15 Thread Emir Arnautović
Hi Roi, I don’t know the details about your test, but trying to assume how it looks like and explain observed. With your flat test you are denormalising data, meaning creating data duplication so the resulting document set is larger. That means more fields/text for Solr/Lucene to analyse and to