dungba88 commented on issue #13564: URL: https://github.com/apache/lucene/issues/13564#issuecomment-2475738069
Hi all, I was looking at this idea for some experimentation ideas (not mean to be intrusive to ongoing effort). If the full sized vectors are exposed through `getFloatVectorValues` then it seems we can just call exactSearch over the results of approximateSearch, but this would inefficiently re-rank `k * oversample * num_segments` instead of just `k * oversample` (with some improve in recall perhaps). However I don't know how to get the full-sized vectors without LeafContextReader. Do you have an idea how we can achieve that? Or maybe we can just reduce oversample to account for the number of segments, e.g: `oversample = oversample / leafReaderContexts.size()` (which maybe differs)? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For additional commands, e-mail: issues-h...@lucene.apache.org