Re: Shard and replica went down in Solr 6.1.0

2019-04-11 Thread vishal patel
Thanks your reply Actually all cache are removed in my solrconfig.xml so no matter for autowarm count for us. And I read your given link about hard commit and soft commit concept. In my production scenario like - index-heavy, Query-heavy [Near Real Time] So we set the hardcommit max time 10 minu

Solr 8.0.0 Delta import add/delete data

2019-04-11 Thread Anuj Bhargava
We have a MySql database (news) which has the following fields - posting_id, date, name, currency, country, expiry etc The database has more than 120 entries. Daily around 20 plus new records are added and around the same number deleted. posting_id is a unique ID for every record. Pl

Re: Solr nested objects (child documents)

2019-04-11 Thread Zheng Lin Edwin Yeo
Which Solr version are you using? Regards, Edwin On Thu, 11 Apr 2019 at 22:34, roiwexler wrote: > Hi, > we're at the process of testing Solr for it's indexing speed which is very > impotent to our application. > we've witnessed strange behavior that we wish to understand before using > it. > wh

Re: How to prevent solr from deleting cores when getting an empty config from zookeeper

2019-04-11 Thread Shawn Heisey
On 4/11/2019 6:44 PM, Koen De Groote wrote: I gathered a solr log from 7.6.0 at TRACE level. Then I replicated the experiment with 6.6.5 and with that version, the directories were not deleted. Log also included. The audit log is from solr7. The deletes start at 01:51:48, which translates to 23

Re: How to prevent solr from deleting cores when getting an empty config from zookeeper

2019-04-11 Thread Koen De Groote
I gathered a solr log from 7.6.0 at TRACE level. Then I replicated the experiment with 6.6.5 and with that version, the directories were not deleted. Log also included. The audit log is from solr7. The deletes start at 01:51:48, which translates to 23:51:48 UTC, which you'll be able to find in th

Re: Suggester case (in)sensitive

2019-04-11 Thread Moritz Schmidt
Does anyone have an idea? Thanks and best regards, Moe > Am 20.03.2019 um 15:19 schrieb Moritz Schmidt : > > Hello everyone. > > I’m trying to build autocomplete functionality. > My setup works but has one problem: > When using HighFrequencyDictionaryFactory the Suggestion-Results I get are >

Re: high cpu threads (solr 7.5)

2019-04-11 Thread Shawn Heisey
On 4/11/2019 1:03 PM, Hari Nakka wrote: I mean the light weight processes (lwp) which were taking high cpu. I pulled the actual threads taking high cpu. full thread dump: *tdump.out* linux lwps: *high-cpu.out* top high cpu lwps mapped to thread nid: *high-cpu-dump.out (included threads taking

Re: How to prevent solr from deleting cores when getting an empty config from zookeeper

2019-04-11 Thread Shawn Heisey
On 4/11/2019 2:40 PM, Koen De Groote wrote: That being explained, am I right in understanding that currently there is no way of configuring Solr so that it won't delete the folders, in this event? In my opinion, Solr should never delete cores unless it has been explicitly *ASKED* to do so with

Re: How to prevent solr from deleting cores when getting an empty config from zookeeper

2019-04-11 Thread Koen De Groote
Shawn, Apologies, I should have explained more properly. To clarify: manually deleting the 'version-2' directory is never something that happened when I first observed this behavior. The reason I did it in this example is that it's the fastest and simplest way to demonstrate the behavior. What I

Re: Real time get - URL size limitation

2019-04-11 Thread Alexandre Rafalovitch
Two quick thoughts without computer access: 1) have you tried post? Usually they do work for all calls. 2) if the list does not change often, you can add it to the request handler definition. Or even as a separate paramset to pass by reference. Either way you would not need to have it in URL every

Real time get - URL size limitation

2019-04-11 Thread Zarski, Jacek
Let's say there is a collection with a schema that specifies 5000 elements. I would like to make a "real time get" call to retrieve a large subset of those elements. Currently I am limited by the size restriction of the HTTP GET method if I want to make that request be "real time" since I would

Re: high cpu threads (solr 7.5)

2019-04-11 Thread Hari Nakka
I mean the light weight processes (lwp) which were taking high cpu. I pulled the actual threads taking high cpu. full thread dump: *tdump.out* linux lwps: *high-cpu.out* top high cpu lwps mapped to thread nid: *high-cpu-dump.out (included threads taking more than 50% virtual core cpu)* https://d

Re: Solr New version 8.1

2019-04-11 Thread Shawn Heisey
On 4/11/2019 3:30 AM, vishal patel wrote: Any one knows about tentative date of stable SOLR 8.1 release? There are never any scheduled release dates. When one of the committers decides it's time for a new release and volunteers to be the release manager, then we have a release. It typically

Re: high cpu threads (solr 7.5)

2019-04-11 Thread Shawn Heisey
On 4/11/2019 2:21 AM, Hari Nakka wrote: Hi Erick, We upgraded JDK to 11. No improvement. Still seeing high cpu utilization randomly. Attached the full threaddump (tdump.out)  and lwp utilization (high-cpu.out) there were more than 30 threads (high-cpu-dump.out)taking high cpu. these are differe

Re: Shard and replica went down in Solr 6.1.0

2019-04-11 Thread Tulsi Das
Hi Vishal, autosoftcommit of 1 sec could be the reason for this issue. Choose autosoftcommit value as long an interval as your application can tolerate, often 15-60 seconds is reasonable, or even longer depending on the requirements Also consider disabling your caches (queryResultCache and filterCa

Can replicas retry for a longer time?

2019-04-11 Thread Walter Underwood
We had a bad situation with our prod cluster. There was a DNS failure in AWS and all the replicas went “brown”. Only the leaders were taking traffic. If the replicas had continued to attempt recovery every five minutes or so, they would have come back online automatically. Is there a way to con

Re: Shard and replica went down in Solr 6.1.0

2019-04-11 Thread vishal patel
Actually in our application , bulky documents are needed to index and on same time we want to see that documents. So in production we keep auto commit 10 minutes and auto soft commit 1 second. Is it ohk? Get Outlook for Android From: Eri

Re: How to prevent solr from deleting cores when getting an empty config from zookeeper

2019-04-11 Thread Shawn Heisey
On 4/11/2019 3:17 AM, Koen De Groote wrote: The basic steps are: set up zookeeper, set up solr root, set up solr. Create dummy collection with example data. Stop the containers. Delete the zookeeper 'version-2' folder. Recreate zookeeper container. Redo the mkroot, recreate solr container. At t

Re: Solr web crawler with recursive option

2019-04-11 Thread Erick Erickson
You are sending malformed XML to Solr. This can be something as silly as having extra spaces at the beginning. I’d capture the page being sent to Solr and put it in a formatter to check it…. Best, Erick > On Apr 11, 2019, at 3:49 AM, Shivprasad Shetty > wrote: > > Hello Team, > > >

Re: gatherNodes question. Is this a bug?

2019-04-11 Thread Kojo
Joel, thank you in advance. Follows SE code and resultset for each piece. I only change some resultset values, without change the logic. I am on Solr 6.6.2. 1. First SE - Inner gatherNodes: sort( gatherNodes( my_collection, gatherNodes( my_collection, search(m

Solr nested objects (child documents)

2019-04-11 Thread roiwexler
Hi, we're at the process of testing Solr for it's indexing speed which is very impotent to our application. we've witnessed strange behavior that we wish to understand before using it. when we indexed 1M docs it took about 63 seconds but when we indexed the same documents only now we've nested t

Re: How to configure default replication type?

2019-04-11 Thread Roger Lehmann
Hi Tulsi, this sounds more like which replication types are preferably chosen for distributed queries with multiple shards per collection. What I'd like to achieve is creating TLOG replicas as default in the first place. But to be honest, I haven't tried it out since I'm now using a complete diffe

Re: Shard and replica went down in Solr 6.1.0

2019-04-11 Thread vishal patel
Thanks Erick, I got your point. As per you, Solr will not go down due to “performance warning” and no need to change the maxdoc value. You talked about number of searchers but in solrconfig.xml there is only 2.[2] In production, we have 27 collection, 2 shards and 2 replicas and 3 zoo keepers

Re: Solr web crawler with recursive option

2019-04-11 Thread Alexandre Rafalovitch
One of the files that post tool identified as XML is not. Possibly a 404 error or some such. So it is trying to parse the file and sees non-xml content right at start. Or if you are sure it is an XML file, maybe there is a BOM mark. Either way try to isolate the specific file. On a bigger picture

Solr web crawler with recursive option

2019-04-11 Thread Shivprasad Shetty
Hello Team, I am working on solr for the first time and got the setup done. Now I have created a core using command line and want to perform webcrawl of a third party site. If I try it with individual links, I am able to do the crawl and index it to the core.This was done using

Solr web crawler with recursive option

2019-04-11 Thread Shivprasad Shetty
I am working on solr for the first time and got the setup done. Now I have created a core using command line and want to perform webcrawl of a third party site. If I try it with individual links, I am able to do the crawl and index it to the core.This was done using > java -Dda

Solr web crawler with recursive option

2019-04-11 Thread Shivprasad Shetty
Hello Team, I am working on solr for the first time and got the setup done. Now I have created a core using command line and want to perform webcrawl of a third party site. If I try it with individual links, I am able to do the crawl and index it to the core.This was done using

Solr web crawler with recursive option

2019-04-11 Thread Shivprasad Shetty
Hello Team, I am working on solr for the first time and got the setup done. Now I have created a core using command line and want to perform webcrawl of a third party site. If I try it with individual links, I am able to do the crawl and index it to the core.This was done using

Using solrconfig for json facet sorting

2019-04-11 Thread sagandhi
Hi, Is it possible to configure sorting on json.facet in solrconfig.xml just like for traditional facets? Thanks, Soham -- Sent from: http://lucene.472066.n3.nabble.com/Solr-User-f472068.html

Solr New version 8.1

2019-04-11 Thread vishal patel
Hi Any one knows about tentative date of stable SOLR 8.1 release? Sent from Outlook

Re: How to prevent solr from deleting cores when getting an empty config from zookeeper

2019-04-11 Thread Koen De Groote
Attached to this mail is a tar.gz with instructions to reproduce. It contains 3 text files with commands and comments. Be sure to check the actual commands before executing. This was tested on a Ubuntu 18.04 VM, with docker installed on it. The order of execution is: - zookeeper instructions.txt

Re: high cpu threads (solr 7.5)

2019-04-11 Thread Hari Nakka
Hi Erick, We upgraded JDK to 11. No improvement. Still seeing high cpu utilization randomly. Attached the full threaddump (tdump.out) and lwp utilization (high-cpu.out) there were more than 30 threads (high-cpu-dump.out)taking high cpu. these are different threads. i couldn't find much looking at