Re: umlaut index ö == o == oe Poss ible?

Stephen Weiss Tue, 16 Dec 2008 16:28:09 -0800

I believe the german porter stemmer should handle this. I haven'tused it with SOLR but I've used it with other projects, and basically,when the word is parsed, the umlauts and also accented vowels areconverted to plain vowels. I guess with SOLR you usesolr.SnowballPorterFilterFactory:


http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#head-b80fb581f4e078142c694014f1a8f60c0935e080


with the German option (like in their example).

You probably want to apply this both at index and query time.

--
Steve

On Dec 16, 2008, at 6:02 PM, Julian Davchev wrote:

Hi,
I am just going through

http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters andmaillist

archive
but somehow can't find the solution. Is it possible that I treat
'möchten' , 'mochten' and  'moechten' the same way.
Of course not hardcoding this but rather work for any umlaut.
Cheers

Re: umlaut index ö == o == oe Poss ible?

Reply via email to