Oh I see I see -- Audrey Lorberfeld Data Scientist, w3 Search IBM audrey.lorberf...@ibm.com
On 10/25/19, 12:21 PM, "David Hastings" <hastings.recurs...@gmail.com> wrote: oh i see what you mean, sorry, i explained it incorrectly. those sentences are what would be in the index, and a general search for 'rush limbaugh' would come back with results where he is an entity higher than if it was two words in a sentence On Fri, Oct 25, 2019 at 12:12 PM David Hastings < hastings.recurs...@gmail.com> wrote: > nope, i boost the fields already tagged at query time against teh query > > On Fri, Oct 25, 2019 at 12:11 PM Audrey Lorberfeld - > audrey.lorberf...@ibm.com <audrey.lorberf...@ibm.com> wrote: > >> So then you do run your POS tagger at query-time, Dave? >> >> -- >> Audrey Lorberfeld >> Data Scientist, w3 Search >> IBM >> audrey.lorberf...@ibm.com >> >> >> On 10/25/19, 12:06 PM, "David Hastings" <hastings.recurs...@gmail.com> >> wrote: >> >> I use them for query boosting, so if someone searches for: >> >> i dont want to rush limbaugh out the door >> vs >> i talked to rush limbaugh through the door >> >> my documents where 'rush limbaugh' is a known entity (noun) and a >> person >> (look at the sentence, its obviously a person and the nlp finds that) >> have >> 'rush limbaugh' stored in a field, which is boosted on queries. this >> makes >> sure results from the second query with him as a person will be >> boosted >> above those from the first query >> >> >> >> >> >> >> >> >> >> >> >> >> On Fri, Oct 25, 2019 at 11:57 AM Nicolas Paris < >> nicolas.pa...@riseup.net> >> wrote: >> >> > Also we are using stanford POS tagger for french. The processing >> time is >> > mitigated by the spark-corenlp package which distribute the process >> over >> > multiple node. >> > >> > Also I am interesting in the way you use POS information within solr >> > queries, or solr fields. >> > >> > Thanks, >> > On Fri, Oct 25, 2019 at 10:42:43AM -0400, David Hastings wrote: >> > > ah, yeah its not the fastest but it proved to be the best for my >> > purposes, >> > > I use it to pre-process data before indexing, to apply more >> metadata to >> > the >> > > documents in a separate field(s) >> > > >> > > On Fri, Oct 25, 2019 at 10:40 AM Audrey Lorberfeld - >> > > audrey.lorberf...@ibm.com <audrey.lorberf...@ibm.com> wrote: >> > > >> > > > No, I meant for part-of-speech tagging __ But that's >> interesting that >> > you >> > > > use StanfordNLP. I've read that it's very slow, so we are >> concerned >> > that it >> > > > might not work for us at query-time. Do you use it at >> query-time, or >> > just >> > > > index-time? >> > > > >> > > > -- >> > > > Audrey Lorberfeld >> > > > Data Scientist, w3 Search >> > > > IBM >> > > > audrey.lorberf...@ibm.com >> > > > >> > > > >> > > > On 10/25/19, 10:30 AM, "David Hastings" < >> hastings.recurs...@gmail.com >> > > >> > > > wrote: >> > > > >> > > > Do you mean for entity extraction? >> > > > I make a LOT of use from the stanford nlp project, and get >> out the >> > > > entities >> > > > and use them for different purposes in solr >> > > > -Dave >> > > > >> > > > On Fri, Oct 25, 2019 at 10:16 AM Audrey Lorberfeld - >> > > > audrey.lorberf...@ibm.com <audrey.lorberf...@ibm.com> >> wrote: >> > > > >> > > > > Hi All, >> > > > > >> > > > > Does anyone use a POS tagger with their Solr instance >> other than >> > > > > OpenNLP’s? We are considering OpenNLP, SpaCy, and Watson. >> > > > > >> > > > > Thanks! >> > > > > >> > > > > -- >> > > > > Audrey Lorberfeld >> > > > > Data Scientist, w3 Search >> > > > > IBM >> > > > > audrey.lorberf...@ibm.com >> > > > > >> > > > > >> > > > >> > > > >> > > > >> > >> > -- >> > nicolas >> > >> >> >>