Hello,

I recommend using the langdetect language detector, it supports many more 
languages and has much higher precission than Tika's detector.

Markus
 
 
-----Original message-----
> From:Alexandre Rafalovitch <arafa...@gmail.com>
> Sent: Wednesday 22nd June 2016 12:32
> To: solr-user <solr-user@lucene.apache.org>
> Subject: Re: Automatic Language Identification
> 
> In both cases, the issues seems to be related to the library not being
> loaded. For Tika identifier, I believe it is
> solr-langid-<version>.jar, for the sia.* it is whatever the book
> recommended.
> 
> Are you running SolrCloud? Additional libraries are slightly
> complicated with that, you need to make sure they are present on all
> the servers this is deployed to.
> 
> Regards,
>    Alex.
> ----
> Newsletter and resources for Solr beginners and intermediates:
> http://www.solr-start.com/
> 
> 
> On 22 June 2016 at 18:54, Hardika Catur S
> <hardika.sa...@solusi247.com.invalid> wrote:
> > Hi,
> >
> > I will make the collection in the collection solrcloud and used for
> > "automatic language identification" but when they failed to make a
> > collection in his process:
> >
> > 1. The automatic language identification
> >
> >
> > ERROR: Failed to create collection 'coba' due to:
> > org.apache.solr.client.solrj.impl.HttpSolrClient$RemoteSolrException:Error
> > from server at http://localhost:8983/solr: Error CREATEing SolrCore
> > 'coba_shard1_replica1': Unable to create core [coba_shard1_replica1] Caused
> > by:
> > org.apache.solr.update.processor.TikaLanguageIdentifierUpdateProcessorFactory
> >
> > The following script in solrconfig.xml :
> >
> > <updateRequestProcessorChain name="langid">
> > <processor name="langid"
> > class="org.apache.solr.update.processor.TikaLanguageIdentifierUpdateProcessorFactory">
> >   <lst name="defaults">
> >     <str name="langid.fl">title, content</str>
> >     <str name="langid.langField">language</str>
> >     <str name="langid.langsField">languages</str>
> >   </lst>
> > </processor>
> > </updateRequestProcessorChain>
> >
> > <requestHandler name="/update" class="solr.UpdateRequestHandler">
> >   <lst name="invariants">
> >     <str name="update.chain">langid</str>
> >   </lst>
> > </requestHandler>
> >
> >
> >
> > 2. Multi Language
> >
> > ERROR: Failed to create collection 'coba' due to:
> > org.apache.solr.client.solrj.impl.HttpSolrClient$RemoteSolrException:Error
> > from server at http://localhost:8983/solr: Error CREATEing SolrCore
> > 'coba_shard1_replica1': Unable to create core [coba_shard1_replica1] Caused
> > by: sia.ch14.MultiTextField
> >
> > The following script in schema.xml :
> >
> >   <fieldType name="text"
> >         class="sia.ch14.MultiTextField" sortMissingLast="true"
> >         defaultFieldType="text_id"
> >         fieldMappings="en:text_en"/>
> >
> > please help me,
> > Thanks,
> > Hardika CS.
> >
> 

Reply via email to