[ 
https://issues.apache.org/jira/browse/OPENNLP-1584?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17862521#comment-17862521
 ] 

ASF GitHub Bot commented on OPENNLP-1584:
-----------------------------------------

rzo1 merged PR #628:
URL: https://github.com/apache/opennlp/pull/628




> FeatureGeneratorUtil shall detect German umlauts with dot as 'cp'
> -----------------------------------------------------------------
>
>                 Key: OPENNLP-1584
>                 URL: https://issues.apache.org/jira/browse/OPENNLP-1584
>             Project: OpenNLP
>          Issue Type: Improvement
>          Components: Name Finder
>    Affects Versions: 2.3.3
>            Reporter: Martin Wiesner
>            Assignee: Martin Wiesner
>            Priority: Minor
>             Fix For: 2.3.4
>
>   Original Estimate: 0.25h
>  Remaining Estimate: 0.25h
>
> German names, such as Änne, Özlem, or Ümit, should be recognized in their 
> abbreviated short form (Ä., Ü., Ö.) by the FeatureGeneratorUtil class. 
> Atm, recognition fails, as the Pattern "capPeriod" only takes regular, 
> capitalized letters into account. This can be fixed easily.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to