Re: [PR] Use SPI instead of Enum for VectorSimilarityFunctions [lucene]

via GitHub Fri, 24 May 2024 01:01:12 -0700


Pulkitg64 commented on PR #13401:
URL: https://github.com/apache/lucene/pull/13401#issuecomment-2128847145


   > I did kind of change before, and the added complexity and backwards 
compatibility concerns just didn't seem warranted. This is why the decision to 
do the scorer pluggability was added to the codecs.
   > 
   > How do you communicate to codecs the similarity kind?
   > 
   > For example, Vector quantization needs to know if the similarity is 
cosine. Do we do a "cosine".equals(similarity.name())? That is very fragile. 
What if some one does an "optimized cosine"? Name conflict or they just cant 
use vector quantization.
   > 
   > What if someone adds an SPI similarity that should be normalized before 
being quantized?
   > 
   > I don't see a way of doing this without leaking this internal dependency 
(normalization being required) into the similarity SPI, which is weird to me.
   > 
   > I just don't think this is feasible.
   
   Thank you @benwtrent for the feedback and sharing your PR. I didn't know, 
this attempt was already made before. The main motivation behind this change 
was to get rid of ENUM implementation which is tightly coupled to field-info. 
This has caused inconvenience in deprecating the COSINE function from the list. 
Not sure what's the best approach then. Will take a look at your PR and 
comments to understand more about this.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

Re: [PR] Use SPI instead of Enum for VectorSimilarityFunctions [lucene]

Reply via email to