benwtrent commented on issue #13350: URL: https://github.com/apache/lucene/issues/13350#issuecomment-2110737920
@naveentatikonda I used the dataset you linked. I simply downloaded the file. Ground truth is just the brute force nearest neighbors. I used the "test" set as the queries (10k of them) and "train" (1M) for the docs. Computing the true NN. Yes, I force merged. I imagine if I didn't, recall would actually be higher. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For additional commands, e-mail: issues-h...@lucene.apache.org