I'm running a 4.2 SOLRCloud instance with multiple servers/shards.  As I'm 
indexing data, I review the results of the STATUS commands and note an 
extremely high number of "deletedDocs".  I've combed through the source data to 
verify whether I'm sending duplicate documents ids, but haven't been able to 
find any.  I'm starting to wonder whether the field is a red herring?

Is the deleted document counter an accurate reflection of documents marked 
deleted in the collection?  My assumption is that if I send a document with the 
same document id, Solr will marked the document as deleted and then insert the 
new one.  Then at merge time the deleted documents are purged from the index.  
I've noted the the total deleted document count will go down with the indexes 
are merged.

Chris Rasmussen

Reply via email to