[ 
https://issues.apache.org/jira/browse/LUCENE-10382?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17479649#comment-17479649
 ] 

Michael Sokolov edited comment on LUCENE-10382 at 1/20/22, 8:20 PM:
--------------------------------------------------------------------

> I'm a little fuzzy on the cost computation being discussed. Is this about the 
> decision to do the ANN or fully materialized KNN?

Yes. I wouldn't worry about that at first though. Maybe we can do three steps 
something like this:
 # implement Query-based filter, always using HNSW search that we have today. 
It would have to be marked with some serious caveats about potential 
performance risk, but we should make progress somehow without insisting on the 
full implementation at once. Perhaps we can just document the risk, mark as 
experimental in javadoc?
 # implement full KNN fallback with a fixed cutoff (based on Query cost?)
 # implement an adaptive cost computation

also, maybe we're overthinking 3 and it's not really needed/simpler than we 
think?


was (Author: sokolov):
> I'm a little fuzzy on the cost computation being discussed. Is this about the 
> decision to do the ANN or fully materialized KNN?

Yes. I wouldn't worry about that at first though. Maybe we can do three steps 
something like this:
 # implement Query-based filter, always using HNSW search that we have today. 
It would have to be marked with some serious caveats about potential 
performance risk, but we should make progress somehow without insisting on the 
full implementation at once. Perhaps we can just document the risk, mark as 
experimental in javadoc?
 # implement full KNN fallback with a fixed cutoff (based on Query cost?)
 # implement an adaptive cost computation

> Allow KnnVectorQuery to operate over a subset of liveDocs
> ---------------------------------------------------------
>
>                 Key: LUCENE-10382
>                 URL: https://issues.apache.org/jira/browse/LUCENE-10382
>             Project: Lucene - Core
>          Issue Type: Improvement
>    Affects Versions: 9.0
>            Reporter: Joel Bernstein
>            Priority: Major
>
> Currently the KnnVectorQuery selects the top K vectors from all live docs.  
> This ticket will change the interface to make it possible for the top K 
> vectors to be selected from a subset of the live docs.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

Reply via email to