Hi all, I was playing a bit with the distributed IDF, I debugged and explored a lot the code and it is a nice feature in a shared environment.
I tried to see what is the behaviour in case we run a distributed query across collections ( ...&collection=a,b,c) Distributed IDF should work in this scenario as well, and the DocumentFrequency calculated should reasonably involve a max doc count which is the total count across all the shards of all the collections. Using the ExactStats cache, the global collection stats are properly calculated ( debugging I see the global stats to be coherent with what I expect). But this stats are lost and BM25 then uses the local stats ( with the consequences we know). I will continue my investigations, has anyone faced this problem before ? Solr version I am trying is 6.3.0 . Cheers -- View this message in context: http://lucene.472066.n3.nabble.com/Distributed-IDF-in-inter-collections-distributed-queries-tp4317519.html Sent from the Solr - User mailing list archive at Nabble.com.