Hello Markus, Ahmet, Forgot to update the thread; optimization works i.e. after optimizing all unique keys have docFreq as 1.
On Wed, Jun 18, 2014 at 1:58 AM, Chris Hostetter <hossman_luc...@fucit.org> wrote: > > : text in it, query is of the type "keywords:(word1 OR word2 ... OR > wordN)". > : The client is relying on default relevancy based sort returned by solr. > : Some documents can get penalised because of some other documents which > were > : deleted. Is this functionality correct? > > yes, because term stats are over the entire index including deleted > documents still in segments -- information about deletions isn't "purged" > from the index until a segment is merged and the stats are recomputed over > the docs/terms in the new segment. > > the only way to get those types of statistics at request time such that > they were *not* afected by deleted documents would involve scanning every > doc to compute them -- which would defeat the point of having the inverted > index. > > > -Hoss > http://www.lucidworks.com/ > -- Thanks & Regards, Apoorva