benwtrent commented on PR #14792:
URL: https://github.com/apache/lucene/pull/14792#issuecomment-2977917618

   > Returning the vectors in search results
   
   Why would you need to do this?
   
   Generally, I would assume that any access to the vector would be "Give me 
what I gave you", and the best we can do with quantized vectors is the 
dequantized vector. 
   
   I don't fully understand how serializing a read-only segment that is missing 
files (e.g. missing the "vec" file), but the format should do the right thing 
and see that the file isn't there and provide an approximate view of the 
floating point vectors.
   
   > Vector counting for metrics
   
   I don't understand what this means really. Just counting how many vectors 
there are? This should be doable via the `FloatVectorValues` interface.
   
   > but need to switch to quantizedVectorValues since searchers won't have 
float vectors anymore and we are okay in accepting the accuracy loss from 
float-to-byte quantization.
   
   Again, I think we should do the nice thing, de-quantize the vectors as the 
user asks for them. 
   
   It should fully satisfy the `FloatVectorValues` API, de-quantizing the 
vectors and indicate that the vector returned is an approximation.
   
   Getting access to the raw quantized bytes is basically useless without all 
the other parameters  that were used to quantized the vector.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

Reply via email to