jpountz opened a new pull request, #838:
URL: https://github.com/apache/lucene/pull/838

   Doc values terms dictionaries keep the first term of each block uncompressed 
so
   that they can somewhat efficiently perform binary searches across blocks.
   Suffixes of the other 63 terms are compressed together using LZ4 to leverage
   redundancy across suffixes. This change improves compression a bit by using 
the
   first (uncompressed) term of each block as a dictionary when compressing
   suffixes of the 63 other terms. This helps with compressing the first few
   suffixes when there's not much context yet that can be leveraged to find
   duplicates.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

Reply via email to