So then you do run your POS tagger at query-time, Dave?

-- 
Audrey Lorberfeld
Data Scientist, w3 Search
IBM
audrey.lorberf...@ibm.com
 

On 10/25/19, 12:06 PM, "David Hastings" <hastings.recurs...@gmail.com> wrote:

    I use them for query boosting, so if someone searches for:
    
    i dont want to rush limbaugh out the door
    vs
    i talked to rush limbaugh through the door
    
    my documents where 'rush limbaugh' is a known entity (noun) and a person
    (look at the sentence, its obviously a person and the nlp finds that) have
    'rush limbaugh' stored in a field, which is boosted on queries.  this makes
    sure results from the second query with him as a person will be boosted
    above those from the first query
    
    
    
    
    
    
    
    
    
    
    
    
    On Fri, Oct 25, 2019 at 11:57 AM Nicolas Paris <nicolas.pa...@riseup.net>
    wrote:
    
    > Also we are using stanford POS tagger for french. The processing time is
    > mitigated by the spark-corenlp package which distribute the process over
    > multiple node.
    >
    > Also I am interesting in the way you use POS information within solr
    > queries, or solr fields.
    >
    > Thanks,
    > On Fri, Oct 25, 2019 at 10:42:43AM -0400, David Hastings wrote:
    > > ah, yeah its not the fastest but it proved to be the best for my
    > purposes,
    > > I use it to pre-process data before indexing, to apply more metadata to
    > the
    > > documents in a separate field(s)
    > >
    > > On Fri, Oct 25, 2019 at 10:40 AM Audrey Lorberfeld -
    > > audrey.lorberf...@ibm.com <audrey.lorberf...@ibm.com> wrote:
    > >
    > > > No, I meant for part-of-speech tagging __ But that's interesting that
    > you
    > > > use StanfordNLP. I've read that it's very slow, so we are concerned
    > that it
    > > > might not work for us at query-time. Do you use it at query-time, or
    > just
    > > > index-time?
    > > >
    > > > --
    > > > Audrey Lorberfeld
    > > > Data Scientist, w3 Search
    > > > IBM
    > > > audrey.lorberf...@ibm.com
    > > >
    > > >
    > > > On 10/25/19, 10:30 AM, "David Hastings" <hastings.recurs...@gmail.com
    > >
    > > > wrote:
    > > >
    > > >     Do you mean for entity extraction?
    > > >     I make a LOT of use from the stanford nlp project, and get out the
    > > > entities
    > > >     and use them for different purposes in solr
    > > >     -Dave
    > > >
    > > >     On Fri, Oct 25, 2019 at 10:16 AM Audrey Lorberfeld -
    > > >     audrey.lorberf...@ibm.com <audrey.lorberf...@ibm.com> wrote:
    > > >
    > > >     > Hi All,
    > > >     >
    > > >     > Does anyone use a POS tagger with their Solr instance other than
    > > >     > OpenNLP’s? We are considering OpenNLP, SpaCy, and Watson.
    > > >     >
    > > >     > Thanks!
    > > >     >
    > > >     > --
    > > >     > Audrey Lorberfeld
    > > >     > Data Scientist, w3 Search
    > > >     > IBM
    > > >     > audrey.lorberf...@ibm.com
    > > >     >
    > > >     >
    > > >
    > > >
    > > >
    >
    > --
    > nicolas
    >
    

Reply via email to