benwtrent commented on PR #14160: URL: https://github.com/apache/lucene/pull/14160#issuecomment-2634245107
@benchaplin I found another bug. The recall numbers were indeed way too good to be true. I was returning duplicate documents 🤦 . So, recall was great because we contained a valid document many times. I have refactored and fixed multiple things, rerunning locally. I will replicate your findings for correlation. Is there anything else needed to replicate your findings other than your code in the lucene util PR? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For additional commands, e-mail: issues-h...@lucene.apache.org