Thank you Emir, we will give this a try.

Russ


On Thu, May 7, 2020 at 12:55 AM Emir Arnautović <
emir.arnauto...@sematext.com> wrote:

> Hi Russel,
> You are right about mm - it is about min term matches. Frequencies are
> usually used to determine score. But you can also filter on number of
> matches using function queries:
> fq={!frange l=3}sum(termfreq(field, ‘barker’), termfreq(field, ‘jones’),
> termfreq(field, ‘baker’))
>
> It is not perfect and you will need to handle phrases at index time to be
> able to match phrases. Or you can combine it with some other query to
> filter out unwanted results and use this approach to make sure frequencies
> match.
>
> HTH,
> Emir
> --
> Monitoring - Log Management - Alerting - Anomaly Detection
> Solr & Elasticsearch Consulting Support Training - http://sematext.com/
>
>
>
> > On 7 May 2020, at 03:12, Russell Bahr <r...@manzama.com> wrote:
> >
> > Hi Atita,
> > We actually looked into that and it does not appear to match based on a
> > single phrase, but says that it must match a certain percentage of the
> > listed phrases.  What we need is something that would match based on a
> > single phrase appearing a minimum number of times i.e. "Barker" minimum
> > number of matches =3 where "Barker" showed up in a document 3 or more
> times.
> >
> > Am I missing something there or am I reading this wrong?
> > The mm (Minimum Should Match) Parameter When processing queries,
> > Lucene/Solr recognizes three types of clauses: mandatory, prohibited, and
> > "optional" (also known as "should" clauses). By default, all words or
> > phrases specified in the q parameter are treated as "optional" clauses
> > unless they are preceded by a "+" or a "-". When dealing with these
> > "optional" clauses, the mm parameter makes it possible to say that a
> > certain minimum number of those clauses must match. The DisMax query
> parser
> > offers great flexibility in how the minimum number can be specified.
> >
> > We did try doing a query and the results that came back were reflective
> > only of minimum number of phrases matching as opposed to a phrase being
> > mentioned a minimum number of times.
> >
> > For example, If I say query for “Google” with mm=100 it doesn’t find
> > Articles with 100 mentions of Google.  It is used for multiple phrase
> > queries.  Example against our servers:
> >
> > query = "Barker" OR "Jones" OR “Baker” mm=1 103,896 results
> > query = "Barker" OR "Jones" OR “Baker” mm=2 1200 results
> > query = "Barker" OR "Jones" OR “Baker” mm=3 16 results
> >
> > Please let me know.
> > Thank you,
> > Russ
> >
> >
> >
> > On Wed, May 6, 2020 at 10:13 AM Atita Arora <atitaar...@gmail.com>
> wrote:
> >
> >> Hi,
> >>
> >> Did you happen to look into :
> >>
> >>
> >>
> https://lucene.apache.org/solr/guide/6_6/the-dismax-query-parser.html#TheDisMaxQueryParser-Themm_MinimumShouldMatch_Parameter
> >>
> >> I believe 6.5.1 has it too.
> >>
> >> I hope it should help.
> >>
> >>
> >> On Wed, May 6, 2020 at 6:46 PM Russell Bahr <rb...@diligent.com> wrote:
> >>
> >>> Hi SOLR team,
> >>> I have been asked if there is a way to return results only if those
> >>> results match a minimum number of times present in the query.
> >>> ( queries looking for a minimum amount of mentions for a particular
> >>> term/phrase. Ie must be mentioned 'x' amount of times to return
> results).
> >>> Is this something that is possible using SOLR 6.5.1?  Is this something
> >>> that would require a newer version of SOLR?
> >>> Any help on this would be appreciated.
> >>> Thank you,
> >>> Russ
> >>>
> >>
>
>

Reply via email to