msfroh commented on issue #14029: URL: https://github.com/apache/lucene/issues/14029#issuecomment-2563247728
I was looking into this (trying to upgrade to 2.5.1) and initially ran into some failing test cases. It looks like they were all related to the switch of the default POSTagFormat from Penn to UD. I was able to get all the tests passing by changing this line: https://github.com/apache/lucene/blob/9a88d8ad25d2568955b26b09b3f18b3c8ad01ba2/lucene/analysis/opennlp/src/java/org/apache/lucene/analysis/opennlp/tools/NLPPOSTaggerOp.java#L33 to ``` tagger = new POSTaggerME(model, POSTagFormat.PENN); ``` (I assume that we should support UD-style tags eventually too, but this at least keeps the existing functionality the same.) -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For additional commands, e-mail: issues-h...@lucene.apache.org