I want to analyze a text based on pattern ";" and separate on whitespace and
it is a Japanese text so use CJKAnalyzer + tokenizer also.
in short I want to do:
<analyzer
class="org.apache.lucene.analysis.cjk.CJKAnalyzer">
<tokenizer class="solr.PatternTokenizerFactory"
pattern=";" />
<tokenizer
class="solr.WhitespaceTokenizerFactory" />
<tokenizer
class="org.apache.lucene.analysis.cjk.CJKTokenizer" />
</analyzer>
Can anyone please tell me how to achieve this?? Because the above syntax is
not at all possible.
--
View this message in context:
http://www.nabble.com/multiple-tokenizers-needed-tp22982382p22982382.html
Sent from the Solr - User mailing list archive at Nabble.com.