[ https://issues.apache.org/jira/browse/LUCENE-9410?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17217670#comment-17217670 ]
Mike Drob commented on LUCENE-9410: ----------------------------------- Thank you for your contribution! For the French part of this, is that the same as https://github.com/snowballstem/snowball/issues/91 ? I don't see a comparable German case reported, maybe you can open a new issue with Snowball at the same repo. Once they release an update, we can consume it. Typically we don't make changes on top of what Snowball provides. > German/French stemmers fail for common forms maux, gegrüßt, grüßend, > schlummert > ------------------------------------------------------------------------------- > > Key: LUCENE-9410 > URL: https://issues.apache.org/jira/browse/LUCENE-9410 > Project: Lucene - Core > Issue Type: Bug > Components: modules/analysis > Affects Versions: 8.5 > Environment: Elasticsearch 7.7.1 running on cloud.elastic.co > Reporter: Ben Kazez > Priority: Critical > Labels: french, german, stemmer, stemming > Time Spent: 10m > Remaining Estimate: 0h > > I'm using Lucene via Elasticsearch 7.7.1. German and French stemmers (either > via the Snowball analyzer, or the "light" or "heavy" stemming analyzers) are > failing to understand some common forms: > French: > - "maux" (plural) should match "mal" (singular) but instead "maux" is > unchanged > German: > - "schlummert" should match "schlummern" (infinitive) but instead is > unchanged > - "grüßend" should match "grüßen" (infinitive) but instead yields "grussend" > - "gegrüßt" should match "grüßen" (infinitive) but instead yields > "gegrusst" > The Elasticsearch folks > [said|https://discuss.elastic.co/t/better-french-and-german-stemming/236283] > I should file a bug with Lucene. -- This message was sent by Atlassian Jira (v8.3.4#803005) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For additional commands, e-mail: issues-h...@lucene.apache.org