Out of curiosity, what would be an example query for your application that
would depend on sentence tokenization, as opposed to simple term
tokenization? I mean, there are no sentence-based query operators in the
Solr query parsers.
-- Jack Krupansky
-----Original Message-----
From: Sandeep B A
Sent: Monday, September 8, 2014 12:24 AM
To: solr-user@lucene.apache.org
Subject: Re: Is there any sentence tokenizers in sold 4.9.0?
Hi Susheel ,
Thanks for the information.
I have crawled few website and all I need is for sentence tokenizers on the
data I have collected.
These websites are English only.
Well I don't have experience in writing custom sentence tokenizers for
solr. Is there any tutorial link which tell how to do it?
Is it possible to integrate nltk for solr? If yes how to do it? Because I
found sentence tokenizers for English in nltk.
Thanks,
Sandeep
On Sep 5, 2014 8:10 PM, "Sandeep B A" <belgavi.sand...@gmail.com> wrote:
Sorry for typo it is solr 4.9.0 instead of sold 4.9.0
On Sep 5, 2014 7:48 PM, "Sandeep B A" <belgavi.sand...@gmail.com> wrote:
Hi,
I was looking out the options for sentence tokenizers default in solr but
could not find it. Does any one used? Integrated from any other language
tokenizers to solr. Example python etc.. Please let me know.
Thanks and regards,
Sandeep