Hi. Did you get this working?

I think this is a problem on many German languages. Using "brute force
dictionary splitting" of compound words will generate to many words.

There is some words about using myspell dictionaries in the tsearch
project here:
http://www.sai.msu.su/~megera/wiki/Tsearch_V2_compound_words

In this approach a word must be marked with a special flag to be able
to participate in a compound word. Did anyone tried to implement a
this in solr/lucene? Is there a good reason to not do this?

To get this functionality I think there must be implemented a new
CompoundWordTokenFilter which “eat” ispell dictionaries. Has anyone
looked at this?

Best regards

Kjell.

Reply via email to