[ https://issues.apache.org/jira/browse/LUCENE-10382?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17479649#comment-17479649 ]
Michael Sokolov edited comment on LUCENE-10382 at 1/20/22, 8:20 PM: -------------------------------------------------------------------- > I'm a little fuzzy on the cost computation being discussed. Is this about the > decision to do the ANN or fully materialized KNN? Yes. I wouldn't worry about that at first though. Maybe we can do three steps something like this: # implement Query-based filter, always using HNSW search that we have today. It would have to be marked with some serious caveats about potential performance risk, but we should make progress somehow without insisting on the full implementation at once. Perhaps we can just document the risk, mark as experimental in javadoc? # implement full KNN fallback with a fixed cutoff (based on Query cost?) # implement an adaptive cost computation also, maybe we're overthinking 3 and it's not really needed/simpler than we think? was (Author: sokolov): > I'm a little fuzzy on the cost computation being discussed. Is this about the > decision to do the ANN or fully materialized KNN? Yes. I wouldn't worry about that at first though. Maybe we can do three steps something like this: # implement Query-based filter, always using HNSW search that we have today. It would have to be marked with some serious caveats about potential performance risk, but we should make progress somehow without insisting on the full implementation at once. Perhaps we can just document the risk, mark as experimental in javadoc? # implement full KNN fallback with a fixed cutoff (based on Query cost?) # implement an adaptive cost computation > Allow KnnVectorQuery to operate over a subset of liveDocs > --------------------------------------------------------- > > Key: LUCENE-10382 > URL: https://issues.apache.org/jira/browse/LUCENE-10382 > Project: Lucene - Core > Issue Type: Improvement > Affects Versions: 9.0 > Reporter: Joel Bernstein > Priority: Major > > Currently the KnnVectorQuery selects the top K vectors from all live docs. > This ticket will change the interface to make it possible for the top K > vectors to be selected from a subset of the live docs. -- This message was sent by Atlassian Jira (v8.20.1#820001) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For additional commands, e-mail: issues-h...@lucene.apache.org