Re: Extracting important multi term phrases from the text

2018-11-20 Thread Pratik Patel
browse/LUCENE-4065 > > > > > > > > > > For now, I am trying to find a workaround using > > > > PatternReplaceFilterFactory. > > > > > > > > > > Regards, > > > > > Pratik > > > > > > > > > > On Thu, Nov

Re: Extracting important multi term phrases from the text

2018-11-16 Thread Alexandre Rafalovitch
> Pratik > > > > > > > > On Thu, Nov 15, 2018 at 4:15 PM Markus Jelsma < > > > markus.jel...@openindex.io> > > > > wrote: > > > > > > > > > Hello Pratik, > > > > > > > > > > We would use ShingleFilter for this indee

Re: Extracting important multi term phrases from the text

2018-11-16 Thread David Hastings
, > > > > Pratik > > > > > > > > On Thu, Nov 15, 2018 at 4:15 PM Markus Jelsma < > > > markus.jel...@openindex.io> > > > > wrote: > > > > > > > > > Hello Pratik, > > > > > > > > > > We would use ShingleFilt

Re: Extracting important multi term phrases from the text

2018-11-16 Thread Pratik Patel
lter for this indeed. If you only want > > > > bigrams/shingles, don't forget to disable outputUnigrams and set both > > > > shinle size limits to 2. > > > > > > > > Regards, > > > > Markus > > > > > > > > -Or

Re: Extracting important multi term phrases from the text

2018-11-16 Thread David Hastings
ze limits to 2. > > > > > > Regards, > > > Markus > > > > > > -Original message- > > > > From:Pratik Patel > > > > Sent: Thursday 15th November 2018 17:00 > > > > To: solr-user@lucene.apache.org > > > >

Re: Extracting important multi term phrases from the text

2018-11-15 Thread Alexandre Rafalovitch
> > Regards, > > Markus > > > > -Original message----- > > > From:Pratik Patel > > > Sent: Thursday 15th November 2018 17:00 > > > To: solr-user@lucene.apache.org > > > Subject: Extracting important multi term phrases from the text > >

Re: Extracting important multi term phrases from the text

2018-11-15 Thread Walter Underwood
M Markus Jelsma >> wrote: >> >>> Hello Pratik, >>> >>> We would use ShingleFilter for this indeed. If you only want >>> bigrams/shingles, don't forget to disable outputUnigrams and set both >>> shinle size limits to 2. >>> &

RE: Extracting important multi term phrases from the text

2018-11-15 Thread Markus Jelsma
works for you. Rergards, Markus -Original message- > From:Pratik Patel > Sent: Thursday 15th November 2018 23:16 > To: solr-user@lucene.apache.org > Subject: Re: Extracting important multi term phrases from the text > > Hi Markus, > > Thanks for

Re: Extracting important multi term phrases from the text

2018-11-15 Thread Pratik Patel
s > > -Original message- > > From:Pratik Patel > > Sent: Thursday 15th November 2018 17:00 > > To: solr-user@lucene.apache.org > > Subject: Extracting important multi term phrases from the text > > > > Hello Everyone, > > > > Standard way of t

RE: Extracting important multi term phrases from the text

2018-11-15 Thread Markus Jelsma
To: solr-user@lucene.apache.org > Subject: Extracting important multi term phrases from the text > > Hello Everyone, > > Standard way of tokenizing in solr would divide the text by white space in > solr. > > Is there a way by which we can index multi-term phrases like &q

Extracting important multi term phrases from the text

2018-11-15 Thread Pratik Patel
Hello Everyone, Standard way of tokenizing in solr would divide the text by white space in solr. Is there a way by which we can index multi-term phrases like "Machine Learning" instead of "Machine", "Learning"? Is it possible to create a specific field type for such phrases which has its own inde