Hello,

According to the wiki http://wiki.apache.org/solr/LanguageAnalysis,
the light stemmers for French (solr.FrenchLightStemFilterFactory and
solr.FrenchMinimalStemFilterFactory) are only available for SOLR 3.1.

Is there a way to make them work with 1.4.1?

- - -

Additionally, there is an "official" list of inflected word forms for
the French language produced by a government agency (this being
France...) It's called "Morphalou":
http://www.cnrtl.fr/lexiques/morphalou/ and it contains over 540 k
inflicted forms.

It's a 162 Mo XML file; it would not be very hard to transform it into
the format for synonyms files for SOLR, but it would result in a
rather huge text file (probably smaller than the original XML, but
still around 100 Mo). How large can a synonyms file be? Is it
dependant on the Java heap size...?

Or is there a better way to use such a list than a synonyms file?

Thanks,
Regards,
EB

Reply via email to