rmuir commented on PR #14075:
URL: https://github.com/apache/lucene/pull/14075#issuecomment-2549123863

   I'm happy to try to debug this but it might be a few days. Issue may be with 
REP rules in the referenced commit. 
   the way these rules work are:
   ```
   REP 3619
   REP a а
   REP c с
   ... (3617 more times)
   ```
   
   if the count (3619) is incorrect, then parser might try to parse a rule 
(such as `REP ээв$ ээ_вэ`) as a another count, leading to the 
NumberFormatException. maybe the C hunspell parser is lenient, so nobody has 
yet noticed. In my experiences the dictionaries often have such problems, hence 
the testing here.
   
   It could also be some other bug in the code, this is just what it looks like 
to me.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

Reply via email to