mingshl commented on PR #14729:
URL: https://github.com/apache/lucene/pull/14729#issuecomment-3007050715

   This is an exciting feature!! I think it's great idea to create a 
`LateInteractionField` that can multi-vector store values in the documents, but 
I have a question in search, when I try to look at the Function score query, it 
seems that for search request, it first runs a knn query, then we fetch the top 
N documents for maxSim reranker. 
   
   In this case, we need to use a single vector for knn query, and then use 
multi-vectors for reranking. 
   
   If we want to use late interaction model for search, after we get the 
multi-vectors from the model and try to construct the function score query, 
   
   first, we need a way to pool the multi-vector into single vector, and put 
into knn query
   second, we would put the multi-vector into the 
lateInteractionFloatRerankQuery 
   
   I can see this way, we can save a lot of computing for too many MaxSim 
calculation, when the array of vector is big. 
   
   I am thinking if that worth a new query type that can handle both pooling 
and rerank, basically the above two steps together. 
   
   
https://github.com/apache/lucene/blob/8dc05f47e0c0bc1c66d530fa6525b96e536a39d0/lucene/queries/src/test/org/apache/lucene/queries/function/TestFunctionScoreQuery.java#L447-L452
 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

Reply via email to