Merge policy would probably affect at how often _some_ of the deleted documents are purged at the cost lower than the full optimization. https://cwiki.apache.org/confluence/display/solr/IndexConfig+in+SolrConfig#IndexConfiginSolrConfig-MergingIndexSegments
But it is still not a 100% solution. Regards, Alex. Personal: http://www.outerthoughts.com/ and @arafalov Solr resources and newsletter: http://www.solr-start.com/ and @solrstart Solr popularizers community: https://www.linkedin.com/groups?gid=6713853 On 28 October 2014 09:42, Shawn Heisey <apa...@elyograg.org> wrote: > On 10/28/2014 7:16 AM, nutchsolruser wrote: >> How can we get exact term frequency with excluding deleted documents term >> frequency, and that is without optimization because optimization is >> expensive in our case ? >> Is there any other way we can get term frequency for entire collection in >> solr? > > > This is not possible except through index optimization. Lucene is > amazingly efficient at computing information across the entire index. > If it were possible to keep that efficiency while also excluding info > from deleted documents, I'm sure it would have already been implemented. > > Thanks, > Shawn >