[ 
https://issues.apache.org/jira/browse/LUCENE-10243?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17446161#comment-17446161
 ] 

Robert Muir commented on LUCENE-10243:
--------------------------------------

I tried this out on top of LUCENE-10239 branch, and also ran perl scripts to 
regenerate new unicode 12.1 tests. But some conformance tests fail.

I haven't debugged it yet, but my guess might be some changes to the UAX#29 
between 9 and 12.1, I will look into it.

> increase unicode versions of tokenizers to unicode 12.1
> -------------------------------------------------------
>
>                 Key: LUCENE-10243
>                 URL: https://issues.apache.org/jira/browse/LUCENE-10243
>             Project: Lucene - Core
>          Issue Type: Task
>            Reporter: Robert Muir
>            Priority: Major
>
> Followup from LUCENE-10239
> Bump the Unicode version of these tokenizers from Unicode 9 to 12.1, which is 
> the most recent supported by the jflex release.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

Reply via email to