rmuir commented on PR #14075: URL: https://github.com/apache/lucene/pull/14075#issuecomment-2549123863
I'm happy to try to debug this but it might be a few days. Issue may be with REP rules in the referenced commit. the way these rules work are: ``` REP 3619 REP a а REP c с ... (3617 more times) ``` if the count (3619) is incorrect, then parser might try to parse a rule (such as `REP ээв$ ээ_вэ`) as a another count, leading to the NumberFormatException. maybe the C hunspell parser is lenient, so nobody has yet noticed. In my experiences the dictionaries often have such problems, hence the testing here. It could also be some other bug in the code, this is just what it looks like to me. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For additional commands, e-mail: issues-h...@lucene.apache.org