Mathieu,
It's not my Kstem. It was written by someone at Umass, Amherst. More info here: 
http://ciir.cs.umass.edu/cgi-bin/downloads/downloads.cgi 

Someone else had already ported it to Lucene. I simply modified that wrapper to 
work with Solr. I'll open an issue for it so that it can (hopefully) be 
integrated into the project.

Cheers... harry

-----Original Message-----
From: Mathieu Lecarme [mailto:[EMAIL PROTECTED] 
Sent: Tuesday, April 22, 2008 3:57 AM
To: solr-user@lucene.apache.org
Subject: Re: better stemming engine than Porter?

Porter stemmer is not only agressive, it is ugly, too. The generated 
code is too old, too  few object centric and should be too slow.
If your kstem compile with java 1.4, why don't you suggest it to lucene 
core?

M.

Wagner,Harry a écrit :
> Hi HH,
> Here's a note I sent Solr-dev a while back:
>
> ---
> I've implemented a Solr plug-in that wraps KStem for Solr use (someone
> else had already written a Lucene wrapper for it).  KStem is considered
> to be more appropriate for library usage since it is much less
> aggressive than Porter (i.e., searches for organization do NOT match on
> organ!). If there is any interest in feeding this back into Solr I would
> be happy to contribute it.
> ---
>
> I believe there was interest in it, but I never opened an issue for it
> and I don't know if it was ever followed-up on. I'd be happy to do that
> now. Can someone on the Solr-dev team point me in the right direction
> for opening an issue?
>
> Thanks... harry
>
>
> -----Original Message-----
> From: Hung Huynh [mailto:[EMAIL PROTECTED] 
> Sent: Monday, April 21, 2008 11:59 AM
> To: solr-user@lucene.apache.org
> Subject: better stemming engine than Porter?
>
> I recall I've read some where in one of the mailing-list archives that
> some
> one had developed a better stemming algo for Solr than the built-in
> Porter
> stemming. Does anyone have link to that stemming module? 
>
> Thanks,
>
> HH 
>
>
>
>
>   



Reply via email to