Wiadomość napisana w dniu 2008-09-12, o godz. 17:58, przez Andrzej
Bialecki:
ok .. that?
<fieldType name="text_po" class="solr.TextField">
<analyzer
class="com.redefine.solr.analysis.StempelTokenFilterFactoryr"/>
</fieldType>
I recommend using Stempelator (or Morfologik) for Polish stemming
and lemmatization. It provides a superset of Stempel features,
namely in addition to the algorithmic stemming it provides a
dictionary-based stemming, and these two methods nicely complement
each other.
I'm not familiar with Java enough to do anything more complicated than
write some wrapping factory class. Stempel seems to have such classes
to wrap, but I did not found any Lucene analyzer that uses Morfologik.
Or am I completely wrong and it should be plugged into Solr in
completely different way?
--
We read Knuth so you don't have to. - Tim Peters
Jarek Zgoda, R&D, Redefine
[EMAIL PROTECTED]