[PR] New int4 scalar quantization [lucene]

via GitHub Thu, 21 Mar 2024 13:05:34 -0700


benwtrent opened a new pull request, #13197:
URL: https://github.com/apache/lucene/pull/13197


   So, there are a handful of new and interesting things that this PR adds:
   
    - Confidence interval optimizations, unlocked even smaller quantization 
bytes
    - New int4 codec which is 8x smaller than float32 with exceptionally good 
recall & faster search
   
   
   But, its not complete yet. While I was finishing up the code here, it came 
to my mind, why is this its own codec? Why shouldn't we add a new option to 
`Lucene99ScalarQuantizedVectorsFormat` (really it would be 
`Lucene911ScalarQuantizedVectorsFormat`) that allows the quantization `bits` to 
be set. I know we don't want a ton of configuration for each codec, but to me 
this seemed reasonable.
   
   What do you think @jpountz?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[PR] New int4 scalar quantization [lucene]

Reply via email to