kotman12 opened a new pull request, #11802:
URL: https://github.com/apache/lucene/pull/11802

   Fix sentence boundary detection bug in case of repeating tokens (i.e. while 
using OpenNLP analysis chain in conjunction with a KeywordRepeatFilter) by 
keeping track of the sentence index and looking ahead one token. Move inner 
sentence iteration to a component to be shared by the sentence-aware OpenNLP 
filters.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

Reply via email to