[ 
https://issues.apache.org/jira/browse/LUCENE-9410?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17217670#comment-17217670
 ] 

Mike Drob commented on LUCENE-9410:
-----------------------------------

Thank you for your contribution!

For the French part of this, is that the same as 
https://github.com/snowballstem/snowball/issues/91 ?

I don't see a comparable German case reported, maybe you can open a new issue 
with Snowball at the same repo.


Once they release an update, we can consume it. Typically we don't make changes 
on top of what Snowball provides.

> German/French stemmers fail for common forms maux, gegrüßt, grüßend, 
> schlummert
> -------------------------------------------------------------------------------
>
>                 Key: LUCENE-9410
>                 URL: https://issues.apache.org/jira/browse/LUCENE-9410
>             Project: Lucene - Core
>          Issue Type: Bug
>          Components: modules/analysis
>    Affects Versions: 8.5
>         Environment: Elasticsearch 7.7.1 running on cloud.elastic.co
>            Reporter: Ben Kazez
>            Priority: Critical
>              Labels: french, german, stemmer, stemming
>          Time Spent: 10m
>  Remaining Estimate: 0h
>
> I'm using Lucene via Elasticsearch 7.7.1. German and French stemmers (either 
> via the Snowball analyzer, or the "light" or "heavy" stemming analyzers) are 
> failing to understand some common forms:
> French:
>   - "maux" (plural) should match "mal" (singular) but instead "maux" is 
> unchanged
> German:
>   - "schlummert" should match "schlummern" (infinitive) but instead is 
> unchanged
>   - "grüßend" should match "grüßen" (infinitive) but instead yields "grussend"
>   - "gegrüßt"  should match "grüßen" (infinitive) but instead yields 
> "gegrusst"
> The Elasticsearch folks 
> [said|https://discuss.elastic.co/t/better-french-and-german-stemming/236283] 
> I should file a bug with Lucene.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

Reply via email to