[ https://issues.apache.org/jira/browse/LUCENE-10239?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17445559#comment-17445559 ]
Robert Muir commented on LUCENE-10239: -------------------------------------- I'd like to do a "minimal" upgrade for this issue, and followup separately with issues such as incrementing unicode versions and using the new emoji properties. > upgrade jflex (1.7.0 -> 1.8.2) > ------------------------------ > > Key: LUCENE-10239 > URL: https://issues.apache.org/jira/browse/LUCENE-10239 > Project: Lucene - Core > Issue Type: Task > Reporter: Robert Muir > Priority: Major > > When reviewing LUCENE-10238, I noticed we still had unicode 9.0 data > specified for our jflex tokenizers. > According to the changelog I see some key benefits from upgrading to jflex > 1.8.2: > * unicode 9 -> unicode 12.1 > * remove our custom emoji regeneration via ICU, as jflex supports emoji > properties directly now. > * Less RAM at runtime to users (two stage tables): > https://github.com/jflex-de/jflex/pull/697 > https://www.jflex.de/changelog.html -- This message was sent by Atlassian Jira (v8.20.1#820001) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For additional commands, e-mail: issues-h...@lucene.apache.org