: I got the error below after adding CJKTokenizer to schema.xml. I : checked the constructor of CJKTokenizer, it requires a Reader parameter, : I guess that's why I get this error, I searched the email archive, it : seems working for other users. Does anyone know what is the problem?
You can use any Lucene "Analyzers" that has a default constructor as is by declaring it in the <analyzer> declaration (the example schema.xml shows this using the GreekAnalyzer) os you could use the CJKAnalyzer directly ... if you want to use a Lucene "Tokenizer" you need a simple Solr "TokenizerFactory" to generate instances of it. writting a TokenizerFactory is easy, they can be simple -- really, REALLY simple ... most of the ones in the Solr code base have more lines of License text then they do of code... http://svn.apache.org/viewvc/lucene/solr/trunk/src/java/org/apache/solr/analysis/LowerCaseTokenizerFactory.java?view=markup http://wiki.apache.org/solr/SolrPlugins#head-718653697f60b44092280c8c506077e0933e3668 http://lucene.apache.org/solr/api/org/apache/solr/analysis/TokenizerFactory.html -Hoss