Hi Solr Users, We use SolrCloud 7.2.1 with 2 Solr nodes in AWS. The shard size for these collections does not exceed more than 5G. They have approximately 16 shards with 2 replicas. We do deletes (ByQuery) as well large updates in some of these Solr collections. We are seeing slower filter queries (95% > 10secs) on these collections in production, same collections, and same queries in our lower environment with similar setup and configuration we seeing much better performance (<100ms). These are NRT indexes, with daily batch updates only.
We see a difference however in the lower environment; that we don't see updates or deletes, we see in Segment Info for each of the Solr code there are ZERO delete percentages. Could this be the reason for the faster query response time in our lower environment? in our production environment, we are seeing about 30-32% of deletes in each core shard/replica pair. Does this segment delete % has any correlation with query response time? We do delete by Query in a loop. Also updates. If it is so, do you suggest to try to do Optimize or expungeDelete at the end every day? Do we need to expunge delete after each delete ByQuery or do it once at the end? Regards, Ganesh