Hello,

We discussed [1] this problem before, and we could not fix it until it became 
clear my collection was rather small, thanks again.

Another collection, now on 7.1, also shows this problem and has default TMP 
settings. This time size is different, each shard of this collection is over 40 
GB, and each shard has about 50 % deleted documents. Each shard's largest 
segment is just under 20 GB with about 75 % deleted documents. After that are a 
few five/six GB segments with just under 50 % deleted documents.

What do i need to change to make Lucene believe that at least that twenty GB 
and three month old segment should be merged away. And how what would the 
predicted indexing performance penalty be?

Regarding reindexing frequency, each document is reindexed at least once every 
30 days, some a more frequent. Updates are indexed every fifteen minutes orso.

Many thanks, Ḿ
arkus

[1] 
http://lucene.472066.n3.nabble.com/Very-high-number-of-deleted-docs-td4357327.html

Reply via email to