gf2121 commented on PR #14361: URL: https://github.com/apache/lucene/pull/14361#issuecomment-2729280384
OK i get expected results that multiple of 16 faster than multiple of 8 when i force `-XX:UseAVX=3`, it can be seen AVX3 is slower on this chip, that may be why java disabled it by default. ``` Benchmark Mode Cnt Score Error Units Decode21Benchmark.decode21Scalar thrpt 5 28.375 ? 0.064 ops/ms Decode21Benchmark.decode21Scalar:asm thrpt NaN --- Decode21Benchmark.decode21Vector thrpt 5 41.844 ? 0.182 ops/ms Decode21Benchmark.decode21Vector:asm thrpt NaN --- Decode21Benchmark.decode21VectorFloorToMultipleOf16 thrpt 5 64.471 ? 0.218 ops/ms Decode21Benchmark.decode21VectorFloorToMultipleOf16:asm thrpt NaN --- Decode21Benchmark.decode21VectorFloorToMultipleOf8 thrpt 5 39.665 ? 0.120 ops/ms Decode21Benchmark.decode21VectorFloorToMultipleOf8:asm thrpt NaN --- ``` -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For additional commands, e-mail: issues-h...@lucene.apache.org