Re: [PR] New JMH benchmark method - vdot8s that implement int8 dotProduct in C… [lucene]

via GitHub Tue, 08 Jul 2025 12:23:30 -0700


mikemccand commented on PR #13572:
URL: https://github.com/apache/lucene/pull/13572#issuecomment-3050061148


   Thanks @shubhamvishu -- these results look incredible, if they hold up.  Odd 
that Panama Vector API (which Lucene uses for `dotProduct`) is so much slower 
on Graviton2.  It's also odd that indexing throughput got a bit slower with the 
change?  Much of the indexing cost for HNSW is actually searching (each insert 
is searching for top K vectors in the graph, so far, and adding those as edges 
for this vector's node)?
   
   Oh, actually, in one run (baseline) you had 3 segments, and then later with 
candidate 2 segments, odd.
   
   I really want the simple Python tool that I can run in my prod env and it 
tells me "yes, Lucene HNSW Is using optimal SIMD instructions in your JDK, 
Lucene version, OS, CPU architecture/revision, virtualized environment, etc." 
-- I opened luceneutil issue to try to make progress on this: 
https://github.com/mikemccand/luceneutil/issues/421 ... maybe `perf` tool can 
give us counters of how many of which CPU asm instructions are used or so?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

Re: [PR] New JMH benchmark method - vdot8s that implement int8 dotProduct in C… [lucene]

Reply via email to