What I am doing right now is to capture all the content under "content_korea" for example, use 'copyField' to duplicate that content to "content_english". "content_korea" gets processed with CJK analyzers, and "content_english" gets processed with usual detailed index/query analyzers, filters, synonyms. Some results do come up, but I have not been able to verify that this approach is yielding better results.
A related question. What does 'copyField' actually do? Does it 'append' content from the source field to the 'target' field? Or does it replace/overwrite it? Thank you. - ashok hossman wrote: > > > : I have documents where text from two languages, e.g. (english & korean) > or > : (english & german) are mixed u p in a fairly intensive way. 20-30% of > the > > if you search the list archives you'll find a lot of results for > "languages" ... it's not something i deal with much but i believe using > separate fields (or dynamic fields) for each language is considered the > best strategy. > > > > > -Hoss > > > -- View this message in context: http://www.nabble.com/More-than-one-language-in-the-same-document-tp22726478p22939331.html Sent from the Solr - User mailing list archive at Nabble.com.