rmuir commented on pull request #15:
URL: https://github.com/apache/lucene/pull/15#issuecomment-800745487


   I looked, I don't think the normalization optimizations here are safe. Many 
rulesets assume a certain form for a reason, because the rules work that way. 
For example, Hangul-Latin works by decomposing into Jamo. Rules are only there 
for Jamo and then it composes back. If you remove the normalization rules, 
Hangul won't get transliterated.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

Reply via email to