rmuir commented on PR #12311:
URL: https://github.com/apache/lucene/pull/12311#issuecomment-1556224675

   i made the benchmarks easier to run with something like this:
   ```
   git clone https://github.com/rmuir/vectorbench
   cd vectorbench
   mvn verify
   java -jar target/vectorbench.jar
   ```
   
   I can confirm everything works on aarch64 and i am experimenting with the 
unrolling. definitely if we remove the unrolling it gets way slower.
   
   so i tried to unroll again (4x instead of 2x), it is only a slight 
improvement in performance on my skylake:
   ```
   Benchmark                             (size)   Mode  Cnt   Score   Error   
Units
   DotProductBenchmark.dotProductNew       1024  thrpt    5   9.997 ± 0.999  
ops/us
   DotProductBenchmark.dotProductNewNew    1024  thrpt    5  11.285 ± 0.161  
ops/us
   DotProductBenchmark.dotProductOld       1024  thrpt    5   2.024 ± 0.028  
ops/us
   ```
   
   But on the aarch64 mac additionall unrolling basically doubles throughput 
again (7.785 -> 14.912). I am experimenting more with this.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

Reply via email to