[
https://issues.apache.org/jira/browse/OPENNLP-1584?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17862521#comment-17862521
]
ASF GitHub Bot commented on OPENNLP-1584:
-----------------------------------------
rzo1 merged PR #628:
URL: https://github.com/apache/opennlp/pull/628
> FeatureGeneratorUtil shall detect German umlauts with dot as 'cp'
> -----------------------------------------------------------------
>
> Key: OPENNLP-1584
> URL: https://issues.apache.org/jira/browse/OPENNLP-1584
> Project: OpenNLP
> Issue Type: Improvement
> Components: Name Finder
> Affects Versions: 2.3.3
> Reporter: Martin Wiesner
> Assignee: Martin Wiesner
> Priority: Minor
> Fix For: 2.3.4
>
> Original Estimate: 0.25h
> Remaining Estimate: 0.25h
>
> German names, such as Änne, Özlem, or Ümit, should be recognized in their
> abbreviated short form (Ä., Ü., Ö.) by the FeatureGeneratorUtil class.
> Atm, recognition fails, as the Pattern "capPeriod" only takes regular,
> capitalized letters into account. This can be fixed easily.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)