jaisonbi opened a new pull request #2213:
URL: https://github.com/apache/lucene-solr/pull/2213


   # Description
   
   Elasticsearch keyword field uses SortedSet DocValues. In our applications, 
“keyword” is the most frequently used field type.
   LUCENE-7081 has done prefix-compression for docvalues terms dict. We can do 
better by adding LZ4 compression to current  prefix-compression. In one of our 
application, the dvd files were ~41% smaller with this change(from 1.95 GB to 
1.15 GB).
   
   This feature is only for the high-cardinality fields. 
   
   # Tests
   
   See Jira LUCENE-9663 for details.
   
   # Checklist
   
   Please review the following and check all that apply:
   
   - [x] I have reviewed the guidelines for [How to 
Contribute](https://wiki.apache.org/solr/HowToContribute) and my code conforms 
to the standards described there to the best of my ability.
   - [x] I have created a Jira issue and added the issue ID to my pull request 
title.
   - [x] I have given Solr maintainers 
[access](https://help.github.com/en/articles/allowing-changes-to-a-pull-request-branch-created-from-a-fork)
 to contribute to my PR branch. (optional but recommended)
   - [x] I have developed this patch against the `master` branch.
   - [x] I have run `./gradlew check`.
   - [x] I have added tests for my changes.
   - [ ] I have added documentation for the [Ref 
Guide](https://github.com/apache/lucene-solr/tree/master/solr/solr-ref-guide) 
(for Solr changes only).
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

Reply via email to