oh i see what you mean, sorry, i explained it incorrectly.
 those sentences are what would be in the index, and a general search for
'rush limbaugh' would come back with results where he is an entity higher
than if it was two words in a sentence

On Fri, Oct 25, 2019 at 12:12 PM David Hastings <
hastings.recurs...@gmail.com> wrote:

> nope, i boost the fields already tagged at query time against teh query
>
> On Fri, Oct 25, 2019 at 12:11 PM Audrey Lorberfeld -
> audrey.lorberf...@ibm.com <audrey.lorberf...@ibm.com> wrote:
>
>> So then you do run your POS tagger at query-time, Dave?
>>
>> --
>> Audrey Lorberfeld
>> Data Scientist, w3 Search
>> IBM
>> audrey.lorberf...@ibm.com
>>
>>
>> On 10/25/19, 12:06 PM, "David Hastings" <hastings.recurs...@gmail.com>
>> wrote:
>>
>>     I use them for query boosting, so if someone searches for:
>>
>>     i dont want to rush limbaugh out the door
>>     vs
>>     i talked to rush limbaugh through the door
>>
>>     my documents where 'rush limbaugh' is a known entity (noun) and a
>> person
>>     (look at the sentence, its obviously a person and the nlp finds that)
>> have
>>     'rush limbaugh' stored in a field, which is boosted on queries.  this
>> makes
>>     sure results from the second query with him as a person will be
>> boosted
>>     above those from the first query
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>     On Fri, Oct 25, 2019 at 11:57 AM Nicolas Paris <
>> nicolas.pa...@riseup.net>
>>     wrote:
>>
>>     > Also we are using stanford POS tagger for french. The processing
>> time is
>>     > mitigated by the spark-corenlp package which distribute the process
>> over
>>     > multiple node.
>>     >
>>     > Also I am interesting in the way you use POS information within solr
>>     > queries, or solr fields.
>>     >
>>     > Thanks,
>>     > On Fri, Oct 25, 2019 at 10:42:43AM -0400, David Hastings wrote:
>>     > > ah, yeah its not the fastest but it proved to be the best for my
>>     > purposes,
>>     > > I use it to pre-process data before indexing, to apply more
>> metadata to
>>     > the
>>     > > documents in a separate field(s)
>>     > >
>>     > > On Fri, Oct 25, 2019 at 10:40 AM Audrey Lorberfeld -
>>     > > audrey.lorberf...@ibm.com <audrey.lorberf...@ibm.com> wrote:
>>     > >
>>     > > > No, I meant for part-of-speech tagging __ But that's
>> interesting that
>>     > you
>>     > > > use StanfordNLP. I've read that it's very slow, so we are
>> concerned
>>     > that it
>>     > > > might not work for us at query-time. Do you use it at
>> query-time, or
>>     > just
>>     > > > index-time?
>>     > > >
>>     > > > --
>>     > > > Audrey Lorberfeld
>>     > > > Data Scientist, w3 Search
>>     > > > IBM
>>     > > > audrey.lorberf...@ibm.com
>>     > > >
>>     > > >
>>     > > > On 10/25/19, 10:30 AM, "David Hastings" <
>> hastings.recurs...@gmail.com
>>     > >
>>     > > > wrote:
>>     > > >
>>     > > >     Do you mean for entity extraction?
>>     > > >     I make a LOT of use from the stanford nlp project, and get
>> out the
>>     > > > entities
>>     > > >     and use them for different purposes in solr
>>     > > >     -Dave
>>     > > >
>>     > > >     On Fri, Oct 25, 2019 at 10:16 AM Audrey Lorberfeld -
>>     > > >     audrey.lorberf...@ibm.com <audrey.lorberf...@ibm.com>
>> wrote:
>>     > > >
>>     > > >     > Hi All,
>>     > > >     >
>>     > > >     > Does anyone use a POS tagger with their Solr instance
>> other than
>>     > > >     > OpenNLP’s? We are considering OpenNLP, SpaCy, and Watson.
>>     > > >     >
>>     > > >     > Thanks!
>>     > > >     >
>>     > > >     > --
>>     > > >     > Audrey Lorberfeld
>>     > > >     > Data Scientist, w3 Search
>>     > > >     > IBM
>>     > > >     > audrey.lorberf...@ibm.com
>>     > > >     >
>>     > > >     >
>>     > > >
>>     > > >
>>     > > >
>>     >
>>     > --
>>     > nicolas
>>     >
>>
>>
>>

Reply via email to