Re: Automatic Language Identification

2016-07-01 Thread William Bell
ge- > From: Markus Jelsma [mailto:markus.jel...@openindex.io] > Sent: Wednesday, June 22, 2016 8:27 AM > To: solr-user@lucene.apache.org; solr-user > Subject: RE: Automatic Language Identification > > Hello, > > I recommend using the langdetect language detector, it supports

RE: Automatic Language Identification

2016-07-01 Thread Allison, Timothy B.
ssage- From: Markus Jelsma [mailto:markus.jel...@openindex.io] Sent: Wednesday, June 22, 2016 8:27 AM To: solr-user@lucene.apache.org; solr-user Subject: RE: Automatic Language Identification Hello, I recommend using the langdetect language detector, it supports many more languages and has

RE: Automatic Language Identification

2016-06-22 Thread Markus Jelsma
Hello, I recommend using the langdetect language detector, it supports many more languages and has much higher precission than Tika's detector. Markus -Original message- > From:Alexandre Rafalovitch > Sent: Wednesday 22nd June 2016 12:32 > To: solr-user > Subje

Re: Automatic Language Identification

2016-06-22 Thread Alexandre Rafalovitch
In both cases, the issues seems to be related to the library not being loaded. For Tika identifier, I believe it is solr-langid-.jar, for the sia.* it is whatever the book recommended. Are you running SolrCloud? Additional libraries are slightly complicated with that, you need to make sure they ar