Thanks Rich and Alexandre, I'll probably test out the CJKTokenizer as well. Previously I had some issues with the Paoding in Solr 5.2.1. But I haven't tested it on 5.3.0 yet.
Regards, Edwin On 23 September 2015 at 23:23, Alexandre Rafalovitch <arafa...@gmail.com> wrote: > You may find the following articles interesting: > > http://discovery-grindstone.blogspot.ca/2014/01/searching-in-solr-analyzing-results-and.html > ( a whole epic journey) > https://dzone.com/articles/indexing-chinese-solr > > Regards, > Alex. > ---- > Solr Analyzers, Tokenizers, Filters, URPs and even a newsletter: > http://www.solr-start.com/ > > > On 23 September 2015 at 10:41, Zheng Lin Edwin Yeo <edwinye...@gmail.com> > wrote: > > Hi, > > > > Would like to check, will StandardTokenizerFactory works well for > indexing > > both English and Chinese (Bilingual) documents, or do we need tokenizers > > that are customised for chinese (Eg: HMMChineseTokenizerFactory)? > > > > > > Regards, > > Edwin >