Using global IDF, if data is not even

On Tuesday, October 27, 2015, Markus Jelsma <markus.jel...@openindex.io>
wrote:

> Hello - regarding fairly random/smooth distribution, you will notice it
> for sure. A solution there is to use distributed collection statistics. On
> top of that you might want to rely on docCount, not maxDoc inside your
> similarity implementation, because docCount should be identical in both
> collections. maxDoc is not really deterministic it seems, since identical
> replica's do not merge segments at the same time.
>
> Markus
>
>
> -----Original message-----
> > From:Scott Stults <sstu...@opensourceconnections.com <javascript:;>>
> > Sent: Tuesday 27th October 2015 21:18
> > To: solr-user@lucene.apache.org <javascript:;>
> > Subject: Re: Solr collection alias - how rank is affected
> >
> > Collection statistics aren't shared between collections, so there's going
> > to be a difference. However, if the distribution is fairly random you
> won't
> > notice.
> >
> > On Tue, Oct 27, 2015 at 3:21 PM, SolrUser1543 <osta...@gmail.com
> <javascript:;>> wrote:
> >
> > > How is document ranking is affected when using a collection alias for
> > > searching on two collections with same schema ? is it affected at all
> ?
> > >
> > >
> > >
> > > --
> > > View this message in context:
> > >
> http://lucene.472066.n3.nabble.com/Solr-collection-alias-how-rank-is-affected-tp4236776.html
> > > Sent from the Solr - User mailing list archive at Nabble.com.
> > >
> >
> >
> >
> > --
> > Scott Stults | Founder & Solutions Architect | OpenSource Connections,
> LLC
> > | 434.409.2780
> > http://www.opensourceconnections.com
> >
>

Reply via email to