Hi, all, 

I am using tika language detection. It is said that, if "langid.langsField"
is set as multivalued string, and then a list of languages can be stored for
the fields specified in "langid.fl". 

Following is how I configure the processor in soleconfig.xml. I tried using
"text" only, and the detected result is language_s="zh_tw"; for
"attr_stream_name", the result is language_s="en". I was expecting, when
adding both "text" and  "attr_stream_name", the result would look like
language_s="en,zh_tw". However, I failed to see the result. 


 <updateRequestProcessorChain name="langid">
   <processor
class="org.apache.solr.update.processor.LangDetectLanguageIdentifierUpdateProcessorFactory">
                <lst name="defaults">           
                <str name="langid.fl">text,attr_stream_name</str>
         <str name="langid.langsField" multiValued="true">language_s</str>
                 <bool name="langid.map.individual">true</bool>  
                 </lst>
       </processor>
     </updateRequestProcessorChain>


I will be grateful if anyone can point my mistake or give some hints how to
do the correct things. Thank you. 

Best Regards, 
Bing 

--
View this message in context: 
http://lucene.472066.n3.nabble.com/How-to-define-a-multivalued-string-type-langid-langsField-in-solrconfig-xml-tp3779602p3779602.html
Sent from the Solr - User mailing list archive at Nabble.com.

Reply via email to