Re: [I] Add refinement of quantized vector scores with fp distance calculations [lucene]

via GitHub Thu, 14 Nov 2024 00:51:18 -0800


dungba88 commented on issue #13564:
URL: https://github.com/apache/lucene/issues/13564#issuecomment-2475738069


   Hi all, I was looking at this idea for some experimentation ideas (not mean 
to be intrusive to ongoing effort).
   
   If the full sized vectors are exposed through `getFloatVectorValues` then it 
seems we can just call exactSearch over the results of approximateSearch, but 
this would inefficiently re-rank `k * oversample * num_segments` instead of 
just `k * oversample` (with some improve in recall perhaps).
   
   However I don't know how to get the full-sized vectors without 
LeafContextReader. Do you have an idea how we can achieve that?
   
   Or maybe we can just reduce oversample to account for the number of 
segments, e.g: `oversample = oversample / leafReaderContexts.size()` (which 
maybe differs)?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Re: [I] Add refinement of quantized vector scores with fp distance calculations [lucene]

Reply via email to