The package Javadoc for Solr analysis is a good start:
http://lucene.apache.org/solr/4_1_0/solr-core/org/apache/solr/analysis/package-tree.html
Especially the AbstractAnalysisFactory:
http://lucene.apache.org/core/4_1_0/analyzers-common/org/apache/lucene/analysis/util/AbstractAnalysisFactory.html
Also, look at the various "factories" in solrconfig.xml for other Solr
extension points.
Including search components, spellcheckers, etc.
-- Jack Krupansky
-----Original Message-----
From: Alexandre Rafalovitch
Sent: Thursday, February 28, 2013 10:32 PM
To: solr-user@lucene.apache.org
Subject: What makes an Analyzer/Tokenizer/CharFilter/etc suitable for Solr?
Hello,
I want to have a unified reference of all different processors one could
use in Solr in various extension points.
I have written a small tool to extract all implementations
of UpdateRequestProcessorFactory, Analyzer, CharFilterFactory, etc
(actually of any root class).
However, I assume not all Lucene Analyzer derivatives can be just plugged
into Solr.
Is it fair to say that the class must:
*) Derive from appropriate root (is there a list of ALL the roots?)
*) Be public and not abstract (though a common sub-root could be)
*) Have a default empty constructor
My preliminary tests seem to indicate this is the case. Am I missing
anything.
Regards,
Alex.
Personal blog: http://blog.outerthoughts.com/
LinkedIn: http://www.linkedin.com/in/alexandrerafalovitch
- Time is the quality of nature that keeps events from happening all at
once. Lately, it doesn't seem to be working. (Anonymous - via GTD book)