If you use CharFilter, you should use "CharStream aware" Tokenizer to
correct terms offsets.
There are two CharStreamAware*Tokenizer in trunk/Solr 1.4.
Probably you want to use CharStreamAwareCJKTokenizer(Factory).
Koji
Ashish P wrote:
After this should I be using same cjkAnalyzer or use charFilter??
Thanks,
Ashish
Koji Sekiguchi-2 wrote:
Ashish P wrote:
I want to convert half width katakana to full width katakana. I tried
using
cjk analyzer but not working.
Does cjkAnalyzer do it or is there any other way??
CharFilter which comes with trunk/Solr 1.4 just covers this type of
problem.
If you are using Solr 1.3, try the patch attached below:
https://issues.apache.org/jira/browse/SOLR-822
Koji