Re: [PR] Add TurboQuant rotation-based vector quantization codec to sandbox [lucene]

via GitHub Tue, 31 Mar 2026 13:48:40 -0700


xande commented on PR #15903:
URL: https://github.com/apache/lucene/pull/15903#issuecomment-4165138163


   > ```
   > recall  latency(ms)  netCPU  avgCpuCount     nDoc  topK  fanout  maxConn  
beamWidth  quantized  index(s)  index_docs/s  force_merge(s)  num_segments  
index_size(MB)  vec_disk(MB)  vec_RAM(MB)  indexType
   >  0.875        0.858   0.855        0.996  1000000    10     100       32   
     250    -4 bits    203.69       4909.49          168.11             1       
  3349.44      3311.157      381.470       HNSW
   > ```
   > 
   > Thats the last run @mccullocht did for Lucene's OSQ technique (1M vectors, 
would need to do the exact same data set for Apples to apples).
   > 
   > I realize performance apples to apples will take way more work (panama 
vector APIs, etc.). I am more concerned about recall, and I am not sure TQ will 
provide any significant recall improvement itself.
   > 
   > The main thing I think that OSQ might be missing is some random rotation 
for non-guassian component vectors (which are an anomaly for the modern 
models). But, adding that to the existing OSQ for Lucene would be a snap 
(though careful thought would be needed as that could be a significant 
performance burden with very little benefit for many users).
   > 
   > It would be good to just do a "flat" index to remove any HNSW noise.
   
   Interestingly enough, at Amazon we were putting a lot of bets on OSQ, though 
on internal datasets we did not see meaningful recall improvement vs non-
   OSQ - low enough for us not to use it. I am planning to run more benchmarks 
to compare and see how TQ compares.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Re: [PR] Add TurboQuant rotation-based vector quantization codec to sandbox [lucene]

Reply via email to