Hi All, Sorry for the delayed response. I was out of office for last few days and was not able to reply. Thanks for the information.
We have a use case were one sentence is the unit token with which we need to do normalization and semantic analyzer. We need to finalize on the type of normalizer and analyzer but was trying to view if solr has any inbuilt libraries, so that no cross language integration might be required. Again Wil get back if something works or not works. @susheel, Thanks will try to see if that works. Thanks, Sandeep. On Sep 8, 2014 12:54 PM, "Sandeep B A" <belgavi.sand...@gmail.com> wrote: > Hi Susheel , > Thanks for the information. > I have crawled few website and all I need is for sentence tokenizers on > the data I have collected. > These websites are English only. > > Well I don't have experience in writing custom sentence tokenizers for > solr. Is there any tutorial link which tell how to do it? > > Is it possible to integrate nltk for solr? If yes how to do it? Because I > found sentence tokenizers for English in nltk. > > Thanks, > Sandeep > On Sep 5, 2014 8:10 PM, "Sandeep B A" <belgavi.sand...@gmail.com> wrote: > >> Sorry for typo it is solr 4.9.0 instead of sold 4.9.0 >> On Sep 5, 2014 7:48 PM, "Sandeep B A" <belgavi.sand...@gmail.com> wrote: >> >>> Hi, >>> >>> I was looking out the options for sentence tokenizers default in solr >>> but could not find it. Does any one used? Integrated from any other >>> language tokenizers to solr. Example python etc.. Please let me know. >>> >>> >>> Thanks and regards, >>> Sandeep >>> >>