[GitHub] [lucene] benwtrent commented on issue #10665: Benchmark KNN search with ann-benchmarks [LUCENE-9625]

GitBox Mon, 31 Oct 2022 07:07:54 -0700


benwtrent commented on issue #10665:
URL: https://github.com/apache/lucene/issues/10665#issuecomment-1297146958


   I opened a PR for ann-benchmarks: 
https://github.com/erikbern/ann-benchmarks/pull/315
   
   I tested PyLucene locally, comparing it to @msokolov's "batch" methodology 
(writing to disk and spinning up a Java process). PyLucene provided faster 
numbers in both Batch and iterative on my m1 mac. This indicates that the 
overhead experienced is acceptable for now. 
   
   There are improvements we can make to PyLucene (native numpy support is a 
big one). But, it seems prudent to get measurements recorded in ann-benchmarks 
and iteratively improve our python over time :).
   
   Also, its for Lucene 9.1. I am going to see about getting a newer PyLucene 
version built hitting the latest release. We can update our benchmark 
implementation to hit the new version when its available.
   
   @msokolov @jtibshirani IDK if y'all want to review the ann-benchmarks 
implementation or not. Let me know what you think.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

[GitHub] [lucene] benwtrent commented on issue #10665: Benchmark KNN search with ann-benchmarks [LUCENE-9625]

Reply via email to