jpountz opened a new pull request, #14155:
URL: https://github.com/apache/lucene/pull/14155

   Currently, blocks of postings get encoded as a bit set instead of packed 
deltas (FOR) whenever the bit set is more storage-efficient. However, the bit 
set approach is quite more CPU-efficient at search time, so this PR introduces 
a small bias towards the bit set encoding by using it as soon as it's more 
storage-efficient than FOR with the next number of bits per value.
   
   The impact on storage efficiency of the Wikipedia dataset is negligible 
(+0.15% on `.doc` files, while `.doc` files don't dominate storage 
requirements, positions do) while some queries get a good speedup.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

Reply via email to