Try the SnowballPorterFilterFactory described here:
http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters
You should use the German2 variant that converts ä and ae to a, ö and oe
to o and so on. More details:
http://snowball.tartarus.org/algorithms/german2/stemmer.html
Every document in solr can have any number of fields which might have
the same source but have different field types and are therefore handled
differently (stored as is, analyzed in different ways...). Use copyField
in your schema.xml to feed your data into multiple fields. During
searching you decide which fields you like to search on (usually the
analyzed ones) and which you retrieve when getting the document back.
Tom
Matthias Eireiner schrieb:
Dear list,
I have two questions regarding German special characters or umlaute.
is there an analyzer which automatically converts all german special
characters to their specific dissected from, such as ü to ue and ä to
ae, etc.?!
I also would like to have, that the search is always run against the
dissected data. But when the results are returned the initial data with
the non modified data should be returned.
Does lucene GermanAnalyzer this job? I run across it, but I could not
figure out from the documentation whether it does the job or not.
thanks a lot in advance.
Matthias