Wiadomość napisana w dniu 2008-09-12, o godz. 17:58, przez Andrzej Bialecki:

ok .. that?

 <fieldType name="text_po" class="solr.TextField">
    <analyzer
class="com.redefine.solr.analysis.StempelTokenFilterFactoryr"/>
  </fieldType>



I recommend using Stempelator (or Morfologik) for Polish stemming and lemmatization. It provides a superset of Stempel features, namely in addition to the algorithmic stemming it provides a dictionary-based stemming, and these two methods nicely complement each other.


I'm not familiar with Java enough to do anything more complicated than write some wrapping factory class. Stempel seems to have such classes to wrap, but I did not found any Lucene analyzer that uses Morfologik. Or am I completely wrong and it should be plugged into Solr in completely different way?

--
We read Knuth so you don't have to. - Tim Peters

Jarek Zgoda, R&D, Redefine
[EMAIL PROTECTED]

Reply via email to