Re: [I] Significant drop in recall for 8 bit Scalar Quantizer [lucene]

via GitHub Tue, 02 Jul 2024 10:12:30 -0700


benwtrent commented on issue #13519:
URL: https://github.com/apache/lucene/issues/13519#issuecomment-2203125856


   The only way to find out is to test it. I don't see how your suggestion 
would work without trying it out.
   
   Its better to think about what it would be in the unsigned `byte` case. A 
scale from `0-255` is what `int8` quantization naturally results in. Then to 
make it signed, you have a change to  `-127-127`.
   
   I don't immediately see how to adjust the correction to handle the switch 
from `0 * 255` to `-127 * 127`.
   
   @naveentatikonda any solution should support max-inner-product, any testing 
with normalized vectors & dot-product might result in false assurance. If you 
are using Lucene util, I would recommend using some older cohere embeddings 
(with their model from before CohereV3), and using the `-metric` `mip`.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

Re: [I] Significant drop in recall for 8 bit Scalar Quantizer [lucene]

Reply via email to