uschindler commented on code in PR #13308: URL: https://github.com/apache/lucene/pull/13308#discussion_r1572243002
########## lucene/core/src/java/org/apache/lucene/index/VectorSimilarityFunction.java: ########## @@ -61,24 +60,6 @@ public float compare(byte[] v1, byte[] v2) { } }, - /** - * Cosine similarity. NOTE: the preferred way to perform cosine similarity is to normalize all - * vectors to unit length, and instead use {@link VectorSimilarityFunction#DOT_PRODUCT}. You - * should only use this function if you need to preserve the original vectors and cannot normalize - * them in advance. The similarity score is normalised to assure it is positive. - */ - COSINE { - @Override - public float compare(float[] v1, float[] v2) { - return Math.max((1 + cosine(v1, v2)) / 2, 0); - } - - @Override - public float compare(byte[] v1, byte[] v2) { - return (1 + cosine(v1, v2)) / 2; - } - }, Review Comment: I am a bit confused why this still happens. @ChrisHegarty already submitted a PR to have a separate mapping from file format's internal ordinal numbers to enum constants. Maybe this was not yet applied, let me check it out. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For additional commands, e-mail: issues-h...@lucene.apache.org