YeonghyeonKO opened a new pull request, #13814:
URL: https://github.com/apache/lucene/pull/13814

   issue : https://github.com/apache/lucene/issues/13802
   
   - Many libraries(git code: 
[Elasticsearch](https://github.com/elastic/elasticsearch/blob/main/modules/analysis-common/src/main/java/org/elasticsearch/analysis/common/CommonAnalysisPlugin.java#L511),
 
[OpenSearch](https://github.com/opensearch-project/OpenSearch/blob/main/modules/analysis-common/src/main/java/org/opensearch/analysis/common/EdgeNGramTokenizerFactory.java#L54))
 based on Lucene use NGramTokenizer.DEFAULT_MAX_NGRAM_SIZE(=`2`) instead of 
EdgeNGramTokenizer's(=`1`) when configuring an **EdgeNGramTokenizer**. They use 
   - By the above reason, it's NOT practical to keep sticking 
DEFAULT_MAX_NGRAM_SIZE of EdgeNGramTokenizer to be `1` so this PR changes it to 
be `2`.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

Reply via email to