Hi all,
I was playing a bit with the distributed IDF, I debugged and explored a lot
the code and it is a nice feature in a shared environment.

I tried to see what is the behaviour in case we run a distributed query
across collections ( ...&collection=a,b,c)

Distributed IDF should work in this scenario as well, and the
DocumentFrequency calculated should reasonably involve a max doc count which
is the total count across all the shards of all the collections.

Using the ExactStats cache, the global collection stats are properly
calculated ( debugging I see the global stats to be coherent with what I
expect).

But this stats are lost and BM25 then uses the local stats ( with the
consequences we know).

I will continue my investigations, has anyone faced this problem before ?

Solr version I am trying is 6.3.0 .


Cheers



--
View this message in context: 
http://lucene.472066.n3.nabble.com/Distributed-IDF-in-inter-collections-distributed-queries-tp4317519.html
Sent from the Solr - User mailing list archive at Nabble.com.

Reply via email to