Re: Can StandardTokenizerFactory works well for Chinese and English (Bilingual)?

Zheng Lin Edwin Yeo Wed, 23 Sep 2015 09:33:00 -0700

Thanks Rich and Alexandre,

I'll probably test out the CJKTokenizer as well.
Previously I had some issues with the Paoding in Solr 5.2.1. But I haven't
tested it on 5.3.0 yet.


Regards,
Edwin


On 23 September 2015 at 23:23, Alexandre Rafalovitch <arafa...@gmail.com>
wrote:

> You may find the following articles interesting:
>
> http://discovery-grindstone.blogspot.ca/2014/01/searching-in-solr-analyzing-results-and.html
> ( a whole epic journey)
> https://dzone.com/articles/indexing-chinese-solr
>
> Regards,
>    Alex.
> ----
> Solr Analyzers, Tokenizers, Filters, URPs and even a newsletter:
> http://www.solr-start.com/
>
>
> On 23 September 2015 at 10:41, Zheng Lin Edwin Yeo <edwinye...@gmail.com>
> wrote:
> > Hi,
> >
> > Would like to check, will StandardTokenizerFactory works well for
> indexing
> > both English and Chinese (Bilingual) documents, or do we need tokenizers
> > that are customised for chinese (Eg: HMMChineseTokenizerFactory)?
> >
> >
> > Regards,
> > Edwin
>

Re: Can StandardTokenizerFactory works well for Chinese and English (Bilingual)?

Reply via email to