Hi, all, I am using tika language detection. It is said that, if "langid.langsField" is set as multivalued string, and then a list of languages can be stored for the fields specified in "langid.fl".
Following is how I configure the processor in soleconfig.xml. I tried using "text" only, and the detected result is language_s="zh_tw"; for "attr_stream_name", the result is language_s="en". I was expecting, when adding both "text" and "attr_stream_name", the result would look like language_s="en,zh_tw". However, I failed to see the result. <updateRequestProcessorChain name="langid"> <processor class="org.apache.solr.update.processor.LangDetectLanguageIdentifierUpdateProcessorFactory"> <lst name="defaults"> <str name="langid.fl">text,attr_stream_name</str> <str name="langid.langsField" multiValued="true">language_s</str> <bool name="langid.map.individual">true</bool> </lst> </processor> </updateRequestProcessorChain> I will be grateful if anyone can point my mistake or give some hints how to do the correct things. Thank you. Best Regards, Bing -- View this message in context: http://lucene.472066.n3.nabble.com/How-to-define-a-multivalued-string-type-langid-langsField-in-solrconfig-xml-tp3779602p3779602.html Sent from the Solr - User mailing list archive at Nabble.com.