Hi Jakub,
I have ported the KStemmer for use in most recent Solr trunk version.
My stemmer is located in the lib directory of Solr "solr/lib/KStemmer-2.00.jar"
because it belongs to Solr.
Write it as FilterFactory and use it as Filter like:
<filter class="de.ubbielefeld.solr.analysis.KStemFilterFactory"
protected="protwords.txt" />
This is how my fieldType looks like:
<fieldType name="text_kstem" class="solr.TextField"
positionIncrementGap="100">
<analyzer type="index">
<tokenizer class="solr.WhitespaceTokenizerFactory" />
<filter class="solr.StopFilterFactory" ignoreCase="true"
words="stopwords.txt" enablePositionIncrements="false" />
<filter class="solr.WordDelimiterFilterFactory" generateWordParts="1"
generateNumberParts="1" catenateWords="1" catenateNumbers="1"
catenateAll="0" splitOnCaseChange="1" />
<filter class="solr.LowerCaseFilterFactory" />
<filter class="de.ubbielefeld.solr.analysis.KStemFilterFactory"
protected="protwords.txt" />
<filter class="solr.RemoveDuplicatesTokenFilterFactory" />
</analyzer>
<analyzer type="query">
<tokenizer class="solr.WhitespaceTokenizerFactory" />
<filter class="solr.StopFilterFactory" ignoreCase="true"
words="stopwords.txt" />
<filter class="solr.WordDelimiterFilterFactory" generateWordParts="1"
generateNumberParts="1" catenateWords="0" catenateNumbers="0"
catenateAll="0" splitOnCaseChange="1" />
<filter class="solr.LowerCaseFilterFactory" />
<filter class="de.ubbielefeld.solr.analysis.KStemFilterFactory"
protected="protwords.txt" />
<filter class="solr.RemoveDuplicatesTokenFilterFactory" />
</analyzer>
</fieldType>
Regards,
Bernd
Am 28.10.2010 14:56, schrieb Jakub Godawa:
> Hi!
> There is a polish stemmer http://www.getopt.org/stempel/ and I have
> problems connecting it with solr 1.4.1
> Questions:
>
> 1. Where EXACTLY do I put "stemper-1.0.jar" file?
> 2. How do I register the file, so I can build a fieldType like:
>
> <fieldType name="text_pl" class="solr.TextField">
> <analyzer class="org.geoopt.solr.analysis.StempelTokenFilterFactory"/>
> </fieldType>
>
> 3. Is that the right approach to make it work?
>
> Thanks for verbose explanation,
> Jakub.