Hi Joel,

Thanks for the pointer. Can you point me to any example implementation.


Parvesh Garg,
Founding Architect

http://www.zettata.com

On Tue, Oct 21, 2014 at 9:32 PM, Joel Bernstein <joels...@gmail.com> wrote:

> The RankQuery cannot be used as filter. It is designed for custom
> ordering/ranking of results only. If it's used as filter the facet counts
> will not match up. If you need a filter collector then you need to use a
> PostFilter.
>
> Joel Bernstein
> Search Engineer at Heliosearch
>
> On Tue, Oct 21, 2014 at 10:50 AM, Erick Erickson <erickerick...@gmail.com>
> wrote:
>
> > I _very strongly_ recommend that you do _not_ do this.
> >
> > First, the "problem" of having documents in the results
> > list with, say, scores < 20% of the max takes care of itself;
> > users stop paging pretty quickly. You're arbitrarily
> > denying the users any chance of finding some documents
> > that _do_ match their query. A user may know that a
> > doc is in the corpus but be unable to find it. Very bad from
> > a confidence-building standpoint.
> >
> > I've seen people put, say, 1-5 stars next to docs in the result
> > to give the user some visual cue that they're getting into "less
> > good" matches, but even that is of very limited value IMO. The
> > stars represent quintiles, 5 stars for docs > 80% of max, 4
> > stars between 60% and 80% etc.
> >
> > If you insist on this, then you'll need to run two passes
> > across the data, the first will get the max score and the second
> > will have a custom collector that somehow gets this number
> > and rejects any docs below the threshold.
> >
> > Bet,
> > Erick
> >
> > On Tue, Oct 21, 2014 at 3:09 AM, Parvesh Garg <parv...@zettata.com>
> wrote:
> > > Hi All,
> > >
> > > We have written a RankQuery plugin with a custom TopDocsCollector to
> > > suppress documents below a certain threshold w.r.t. to the maxScore for
> > > that query. It works fine and is reflected well with numFound and start
> > > parameters.
> > >
> > > Our problem lies with facet counts. Even though the docs numFound
> gives a
> > > very less number, the facet counts are still coming from unsuppressed
> > query
> > > results.
> > >
> > > E.g. in a test with a threshold of 20% , we reduced the totalDocs from
> > > 46030 to 6080 but the top facet count on a field is still 20500
> > >
> > > The query parameter we are using looks like rq={!threshold value=0.2}
> > >
> > > Is there a way propagate the suppression of results to FacetsComponent
> as
> > > well? Can we send the same rq to FacetsComponent ?
> > >
> > >
> > >
> > > Regards,
> > > Parvesh Garg,
> > >
> > > http://www.zettata.com
> >
>

Reply via email to