harenlin commented on code in PR #13470: URL: https://github.com/apache/lucene/pull/13470#discussion_r1632391085
########## lucene/core/src/java/org/apache/lucene/search/TopDocs.java: ########## @@ -350,4 +354,38 @@ private static TopDocs mergeAux( return new TopFieldDocs(totalHits, hits, sort.getSort()); } } + + /** Reciprocal Rank Fusion method. */ + public static TopDocs rrf(int TopN, int k, TopDocs[] hits) { + Map<Integer, Float> rrfScore = new HashMap<>(); + long minHits = Long.MAX_VALUE; + for (TopDocs topDoc : hits) { + minHits = Math.min(minHits, topDoc.totalHits.value); + Map<Integer, Float> scoreMap = new HashMap<>(); + for (ScoreDoc scoreDoc : topDoc.scoreDocs) { + scoreMap.put(scoreDoc.doc, scoreDoc.score); + } + + List<Map.Entry<Integer, Float>> scoreList = new ArrayList<>(scoreMap.entrySet()); + scoreList.sort(Map.Entry.comparingByValue()); Review Comment: Oops! My bad. I think we got something wrong right here. The for loop traversal ```for (ScoreDoc scoreDoc : topDoc.scoreDocs)``` is wrong, we should actually traverse the sorted results, i.e., scoreList, to add the ranking result to ```rrfScore```. P.S. For this part, however, I think we should determine the implementation of combining ```ScoreDoc.doc``` and ```ScoreDoc.shardIndex``` together first. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For additional commands, e-mail: issues-h...@lucene.apache.org