Hi, There is clearly something wrong when your two replicas are not in sync. Could you go to the “Cloud->Tree” tab of admin UI and look in the overseer queue whether you find signs of stuck jobs or something? Btw - what warnings do you see in the logs? Anything repeatedly popping up?
I would also try the following: 1. Take down the node hosting replica 1 (assuming that replica2 is the correct, most current) 2. Manually empty the data folder 3. Take the node up again 4. Verify that a full index recovery happens, and that they get back in sync 5. Run your indexing procedure. 6. Verify that both replicas are still in sync -- Jan Høydahl, search solution architect Cominvent AS - www.cominvent.com > 16. aug. 2016 kl. 06.51 skrev Pranaya Behera <pranaya.beh...@igp.com>: > > Hi, > a.) Yes index is static, not updated live. We index new documents over old > documents by this sequesce, deleteall docs, add 10 freshly fetched from db, > after adding all the docs to cloud instance, commit. Commit happens only once > per collection, > b.) I took one shard and below are the results for the each replica, it has 2 > replica. > Replica - 2 > Last Modified: 33 minutes ago > Num Docs: 127970 > Max Doc: 127970 > Heap Memory Usage: -1 > Deleted Docs: 0 > Version: 14530 > Segment Count: 5 > Optimized: yes > Current: yes > Data: /var/solr/data/product_shard1_replica2/data > Index: /var/solr/data/product_shard1_replica2/data/index.20160816040537452 > Impl: org.apache.solr.core.NRTCachingDirectoryFactory > > Replica - 1 > Last Modified: about 19 hours ago > Num Docs: 234013 > Max Doc: 234013 > Heap Memory Usage: -1 > Deleted Docs: 0 > Version: 14272 > Segment Count: 7 > Optimized: yes > Current: no > Data: /var/solr/data/product_shard1_replica1/data > Index: /var/solr/data/product_shard1_replica1/data/index > Impl: org.apache.solr.core.NRTCachingDirectoryFactory > > c.) With the admin ui: if I query for all, *:* it gives different numFound > each time. > e.g. > 1. > > |{ "responseHeader":{ "zkConnected":true, "status":0, "QTime":7, "params":{ > "q":"*:*", "indent":"on", "wt":"json", "_":"1471322871767"}}, > "response":{"numFound":452300,"start":0,"maxScore":1.0, 2. | > |{ "responseHeader":{ "zkConnected":true, "status":0, "QTime":23, "params":{ > "q":"*:*", "indent":"on", "wt":"json", "_":"1471322871767"}}, > "response":{"numFound":574013,"start":0,"maxScore":1.0, This is queried live > from the solr instances. | > > It happens with any type of queries, if I search in parent document or search > through child documents to get parents. sorting is used in both cases but > with different field, while doingblock join query sortingis on the child > document field, otherwise on the parent document field. > > d.) I dont find any errors in the logs. All warnings only. > > On 14/08/16 02:56, Jan Høydahl wrote: >> Could it be that your cluster is not in sync, so that when Solr picks three >> nodes, results will vary depending on what replica answers? >> >> A few questions: >> >> a) Is your index static, i.e. not being updated live? >> b) Can you try to go directly to the core menu of both replicas for each >> shard, and compare numDocs / maxDocs for each? Both replicas in each shard >> should have same count. >> c) What are you querying on and sorting by? Does it happen with only one >> query and sorting? >> d) Are there any errors in the logs? >> >> If possible, please share some queries, responses, config, screenshots etc. >> >> -- >> Jan Høydahl, search solution architect >> Cominvent AS - www.cominvent.com >> >>> 13. aug. 2016 kl. 12.10 skrev Pranaya Behera <pranaya.beh...@igp.com>: >>> >>> Hi, >>> I am running solr 6.1.0 with solrcloud. We have 3 instance of zookeeper >>> and 3 instance of solrcloud. All three of them are active and up. One >>> collection has 3 shards, each shard has 2 replicas. >>> >>> Everytime query whether from solrj or admin ui, getting inconsistent >>> results. e.g. >>> 1. numFound is always fluctuating. >>> 2. facet count shows the count for a field, filter query on that field gets >>> 0 results. >>> 3. luke requests work(not sure whether gives correct info of all the >>> dynamic field) on per shard not on collection when invoked from curl but >>> doesnt work when called from solrj. >>> 4. admin ui shows expanded results, same query goes from solrj, >>> getExpandedResults() gives 0 docs. >>> >>> What would be cause of all this ? Any pointer to look for an error anything >>> in the logs. >> >