stefanvodita commented on code in PR #12172: URL: https://github.com/apache/lucene/pull/12172#discussion_r1537416729
########## lucene/analysis/common/src/resources/org/apache/lucene/analysis/ro/stopwords.txt: ########## @@ -190,27 +207,34 @@ sale sau său se +și şi sînt sîntem +sînteți Review Comment: I think your point is the `î` spellings for "a fi" were used officially at some point, while the `â` spellings never were. It's tricky to talk about what is correct in a language though. Have people written text using the `â` spellings? Of course, switching `î` and `â` is a common pattern, which is "correct" for other words. Should we handle stemming for the `â` spellings then? I don't know, maybe it depends on how frequent the `â` spelling is in practice. I don't feel strongly about this one way or the other. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For additional commands, e-mail: issues-h...@lucene.apache.org