mgodwan opened a new issue, #12986:
URL: https://github.com/apache/lucene/issues/12986

   ### Description
   
   Today, 
[BloomFilteringPostingsFormat](https://github.com/apache/lucene/blob/main/lucene/codecs/src/java/org/apache/lucene/codecs/bloom/BloomFilteringPostingsFormat.java)
 in Lucene is marked experimental.
   
   Based on our analysis of the the data structure in OpenSearch for the 
Primary Key field using the `nyc_taxis` workload [[See 
Issue](https://github.com/opensearch-project/OpenSearch/issues/4489#issuecomment-1724998489)],
 we have found that it proves very useful for indexing performance, and also 
certain term queries/Get Document calls on the PK.
   
   We want to introduce this as an opt-in feature in OpenSearch for customers 
so that they can take advantage of the performance improvements, and wanted 
inputs from the community on the following:
   
   1. Why is the BloomFilteringPostingsFormat in experimental status?
   2. Is it possible to contribute and mark it as a core feature with support 
for backward compatibility in Lucene?
   
   We've done few changes in OpenSearch to support  an off-heap implementation 
and introduce certain knobs which may prove useful for Lucene customers and 
would like to see if they can be contributed to Lucene as well 
[opensearch-project/OpenSearch/pull/11027].
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

Reply via email to