[ 
https://issues.apache.org/jira/browse/OPENNLP-1599?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17868374#comment-17868374
 ] 

ASF GitHub Bot commented on OPENNLP-1599:
-----------------------------------------

rzo1 opened a new pull request, #641:
URL: https://github.com/apache/opennlp/pull/641

   Thank you for contributing to Apache OpenNLP.
   
   In order to streamline the review of the contribution we ask you
   to ensure the following steps have been taken:
   
   ### For all changes:
   - [x] Is there a JIRA ticket associated with this PR? Is it referenced 
        in the commit message?
   
   - [x] Does your PR title start with OPENNLP-XXXX where XXXX is the JIRA 
number you are trying to resolve? Pay particular attention to the hyphen "-" 
character.
   
   - [x] Has your PR been rebased against the latest commit within the target 
branch (typically main)?
   
   - [x] Is your initial contribution a single, squashed commit?
   
   ### For code changes:
   - [x] Have you ensured that the full suite of tests is executed via mvn 
clean install at the root opennlp folder?
   - [x] Have you written or updated unit tests to verify your changes?
   - [ ] If adding new dependencies to the code, are these dependencies 
licensed in a way that is compatible for inclusion under [ASF 
2.0](http://www.apache.org/legal/resolved.html#category-a)? 
   - [ ] If applicable, have you updated the LICENSE file, including the main 
LICENSE file in opennlp folder?
   - [ ] If applicable, have you updated the NOTICE file, including the main 
NOTICE file found in opennlp folder?
   
   ### For documentation related changes:
   - [ ] Have you ensured that format looks appropriate for the output in which 
it is rendered?
   
   ### Note:
   
   see https://github.com/apache/opennlp-models/pull/9
   
   Rational is to avoid something like
   
   ```java
    try {
                   final ClassPathModelLoader loader = new 
ClassPathModelLoader();
                   final Set<ClassPathModelEntry> models = new 
ClassgraphModelFinder().findModels(false);
   
                   for (ClassPathModelEntry entry : models) {
                       final ClassPathModel model = loader.load(entry);
                       if (model.getModelName().contains(element.getLang())) {
                           final TokenizerME tokenizer = new TokenizerME(new 
TokenizerModel(new ByteArrayInputStream(model.model())));
                           tokenizers.put(element.getLang(), tokenizer);
                           return tokenizer;
                       }
                   }
               } catch (IOException e) {
                   throw new PipelineExecutionException(e);
               }
   ```
   
   and just directly use `model.getLanguage()` (from a user perspective because 
the model's name might not actually contain the actual language the model was 
trained for).




> Add "model.language" to opennlp-models
> --------------------------------------
>
>                 Key: OPENNLP-1599
>                 URL: https://issues.apache.org/jira/browse/OPENNLP-1599
>             Project: OpenNLP
>          Issue Type: Improvement
>          Components: Models
>            Reporter: Richard Zowalla
>            Assignee: Richard Zowalla
>            Priority: Major
>             Fix For: 2.4.1
>
>
> I just integrated opennlp-models via Maven / Classpath in my application. 
> While doing so, I noticed, that there is no simple way to get the language of 
> the related model without loading it and checking the models name.
>  
> Therefore, it would be neat to add a new property key "model.language" to 
> retrieve the bootstraped language identifier to ease user consumption.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to