Re: Solr 7.0.1 Duplicate document appearing in search results

2019-05-14 Thread Adam Walz
Thanks Erick, We've never merged indexes. We don't use the MapReduceIndexerTool, but do use an external map reduce process to reindex. To reindex from an empty state we have a map reduce job which runs on a separate HBase cluster and indexes into this shard. During this job each mapper is concurre

Re: Solr 7.0.1 Duplicate document appearing in search results

2019-05-14 Thread Erick Erickson
This is indeed strange. First of all, forget about explanations that involve the transaction log etc. When Lucene opens a searcher, it is only for closed segments, the tlog has nothing to do with that. Have you ever merget indexes? The MapReduceIndexerTool, if you ever used it, does not de-dupl

Re: Seeking advice on SolrCloud production architecture with CDCR

2019-05-14 Thread Shawn Heisey
On 5/14/2019 4:55 PM, Cody Burleson wrote: I’m worried, for example, about spreading the Zookeper cluster between the two data centers because of potential latency across the pond. Maybe we keep the ZK ensemble on one side of the pond only? I imagined, for instance, 2 ZK nodes on one server,

Seeking advice on SolrCloud production architecture with CDCR

2019-05-14 Thread Cody Burleson
Hi, all. We’re upgrading an old Solr 3.5 setup (master/slave replication) to SolrCloud (v7 or v8) and with the addition of a new data center (for dual data centers). I’ve done a lot of homework, but could still use some advice. While documentation explains Zookeper and SolrCloud pretty well, I d

Solr 7.0.1 Duplicate document appearing in search results

2019-05-14 Thread Adam Walz
In my solr schema I have set a uniqueKey of "id" where the id field is a solr.StrField. When querying with this field as a filter I would expect to always get 1 or 0 documents as a result. However I am getting back multiple documents with the same "id" field, but different internal `docid`s. This p

Solr 8.1 issue with collection aliases

2019-05-14 Thread Jörn Franke
Hi, I tried to upgrade from 8.0 to 8.1. I noticed that there is an issue with collection aliases, but I am not 100% sure it is due to the upgrade. Situation: I have a collection called c_testcollection. I have an alias called testcollection. Alias "testcollection" points to "c_testcollection". On

Re: Solr node goes into recovery mode

2019-05-14 Thread Erick Erickson
I think it’s the wrong question to ask. By using docaValues you’ll be able to significantly reduce the heap allocated to the Java process, reduce the overhead of garbage collections, reduce the possibility of nodes going into recovery and increase stability generally. Compared to those gains, pe

Re: Writing unit tests to test complex solr queries

2019-05-14 Thread Angie Rabelero
Hi, I’ll advised you to extend the class SolrCloudTestCase, which extends the MiniSolrCloudCluster. Theres a hello world example in the solr source at https://github.com/apache/lucene-solr/blob/master/solr/core/src/test/org/apache/solr/HelloWorldSolrCloudTestCase.java. Here’s how I setup a clust

Re: Solr node goes into recovery mode

2019-05-14 Thread Maulin Rathod
Thanks Erick, I understand using docvalue should improve query performance. Please correct me if my understanding is incorrect. Regards, Maulin On May 14, 2019 19:11, Erick Erickson wrote: Use docValues on all fields you group, facet or sort on. NOTE: you _must_ re-index from scratch, I’d

RE: mergeindexes action does not seem to be merging cores.

2019-05-14 Thread Piyush Kumar Nayak
Yes that worked. Thanks Erick, for your help. -Original Message- From: Erick Erickson Sent: Tuesday, May 14, 2019 7:09 PM To: solr-user@lucene.apache.org Subject: Re: mergeindexes action does not seem to be merging cores. Did you commit afterwards? > On May 14, 2019, at 8:04 AM, Piyu

Re: Solr node goes into recovery mode

2019-05-14 Thread Erick Erickson
Use docValues on all fields you group, facet or sort on. NOTE: you _must_ re-index from scratch, I’d index to a new collection and start over. Paradoxically your index size _on disk_ will increase, but your JVM will need drastically less heap. See: http://blog.thetaphi.de/2012/07/use-lucenes-mm

Re: mergeindexes action does not seem to be merging cores.

2019-05-14 Thread Erick Erickson
Did you commit afterwards? > On May 14, 2019, at 8:04 AM, Piyush Kumar Nayak > wrote: > > Hi, > > I don't seem to be able to get the merge core feature to work with Solr 7.2.1. > I'm using the srcCore parameter method documented at > https://lucene.apache.org/solr/guide/6_6/coreadmin-api.html

mergeindexes action does not seem to be merging cores.

2019-05-14 Thread Piyush Kumar Nayak
Hi, I don't seem to be able to get the merge core feature to work with Solr 7.2.1. I'm using the srcCore parameter method documented at https://lucene.apache.org/solr/guide/6_6/coreadmin-api.html#CoreAdminAPI-MERGEINDEXES I am making the following GET HTTP call using a browser: http://localhost:

Re: query keyword but no result (solr 8)

2019-05-14 Thread Charles Sanders
Derrick, This makes me think you do not have a default query field identified. Look in your solrcong.xml file for the requesthandler you are calling and see if it has the "df" parameter set. Should look something like: text Hope this helps! - Original Message - From: "Derrick Cui"

Re: Writing unit tests to test complex solr queries

2019-05-14 Thread Mikhail Khludnev
Hello, Pratick. Welcome to mysterious world of Solr testing. The best way is to find existing test closest to your problem field, copy in and amend necessarily. What about https://github.com/apache/lucene-solr/blob/master/solr/solrj/src/test/org/apache/solr/client/solrj/io/stream/StreamExpressionTe