kaivalnp commented on PR #15285: URL: https://github.com/apache/lucene/pull/15285#issuecomment-3371733825
[`VectorUtilBenchmark`](https://github.com/apache/lucene/blob/da8c674bf85855d4b56dc70ad44d207b437f3aca/lucene/benchmark-jmh/src/java/org/apache/lucene/benchmark/jmh/VectorUtilBenchmark.java#L46) results: `main` ``` Benchmark (size) Mode Cnt Score Error Units VectorUtilBenchmark.binaryCosineScalar 1024 thrpt 15 0.841 ± 0.001 ops/us VectorUtilBenchmark.binaryCosineVector 1024 thrpt 15 4.778 ± 0.012 ops/us VectorUtilBenchmark.binaryDotProductScalar 1024 thrpt 15 2.289 ± 0.012 ops/us VectorUtilBenchmark.binaryDotProductUint8Scalar 1024 thrpt 15 2.307 ± 0.010 ops/us VectorUtilBenchmark.binaryDotProductUint8Vector 1024 thrpt 15 8.040 ± 0.001 ops/us VectorUtilBenchmark.binaryDotProductVector 1024 thrpt 15 8.040 ± 0.001 ops/us VectorUtilBenchmark.binaryHalfByteDotProductBothPackedScalar 1024 thrpt 15 2.368 ± 0.001 ops/us VectorUtilBenchmark.binaryHalfByteDotProductBothPackedVector 1024 thrpt 15 11.652 ± 0.104 ops/us VectorUtilBenchmark.binaryHalfByteDotProductScalar 1024 thrpt 15 2.378 ± 0.002 ops/us VectorUtilBenchmark.binaryHalfByteDotProductSinglePackedScalar 1024 thrpt 15 2.446 ± 0.009 ops/us VectorUtilBenchmark.binaryHalfByteDotProductSinglePackedVector 1024 thrpt 15 2.627 ± 0.013 ops/us VectorUtilBenchmark.binaryHalfByteDotProductVector 1024 thrpt 15 20.677 ± 0.160 ops/us VectorUtilBenchmark.binaryHalfByteSquareBothPackedScalar 1024 thrpt 15 1.642 ± 0.001 ops/us VectorUtilBenchmark.binaryHalfByteSquareBothPackedVector 1024 thrpt 15 12.614 ± 0.010 ops/us VectorUtilBenchmark.binaryHalfByteSquareScalar 1024 thrpt 15 2.465 ± 0.006 ops/us VectorUtilBenchmark.binaryHalfByteSquareSinglePackedScalar 1024 thrpt 15 2.022 ± 0.001 ops/us VectorUtilBenchmark.binaryHalfByteSquareSinglePackedVector 1024 thrpt 15 2.590 ± 0.012 ops/us VectorUtilBenchmark.binaryHalfByteSquareVector 1024 thrpt 15 18.526 ± 0.012 ops/us VectorUtilBenchmark.binarySquareScalar 1024 thrpt 15 2.431 ± 0.007 ops/us VectorUtilBenchmark.binarySquareUint8Scalar 1024 thrpt 15 2.422 ± 0.025 ops/us VectorUtilBenchmark.binarySquareUint8Vector 1024 thrpt 15 6.709 ± 0.002 ops/us VectorUtilBenchmark.binarySquareVector 1024 thrpt 15 6.710 ± 0.001 ops/us VectorUtilBenchmark.floatCosineScalar 1024 thrpt 15 1.419 ± 0.001 ops/us VectorUtilBenchmark.floatCosineVector 1024 thrpt 75 8.913 ± 0.013 ops/us VectorUtilBenchmark.floatDotProductScalar 1024 thrpt 15 3.734 ± 0.004 ops/us VectorUtilBenchmark.floatDotProductVector 1024 thrpt 75 12.561 ± 0.346 ops/us VectorUtilBenchmark.floatSquareScalar 1024 thrpt 15 3.181 ± 0.013 ops/us VectorUtilBenchmark.floatSquareVector 1024 thrpt 75 12.370 ± 0.398 ops/us VectorUtilBenchmark.l2Normalize 1024 thrpt 15 3.016 ± 0.002 ops/us VectorUtilBenchmark.l2NormalizeVector 1024 thrpt 75 12.349 ± 0.719 ops/us ``` This PR ``` Benchmark (size) Mode Cnt Score Error Units VectorUtilBenchmark.binaryCosineScalar 1024 thrpt 15 0.841 ± 0.001 ops/us VectorUtilBenchmark.binaryCosineVector 1024 thrpt 15 4.860 ± 0.007 ops/us VectorUtilBenchmark.binaryDotProductScalar 1024 thrpt 15 2.298 ± 0.014 ops/us VectorUtilBenchmark.binaryDotProductUint8Scalar 1024 thrpt 15 2.288 ± 0.024 ops/us VectorUtilBenchmark.binaryDotProductUint8Vector 1024 thrpt 15 8.040 ± 0.001 ops/us VectorUtilBenchmark.binaryDotProductVector 1024 thrpt 15 8.039 ± 0.001 ops/us VectorUtilBenchmark.binaryHalfByteDotProductBothPackedScalar 1024 thrpt 15 2.376 ± 0.003 ops/us VectorUtilBenchmark.binaryHalfByteDotProductBothPackedVector 1024 thrpt 15 11.498 ± 0.286 ops/us VectorUtilBenchmark.binaryHalfByteDotProductScalar 1024 thrpt 15 2.376 ± 0.002 ops/us VectorUtilBenchmark.binaryHalfByteDotProductSinglePackedScalar 1024 thrpt 15 2.449 ± 0.007 ops/us VectorUtilBenchmark.binaryHalfByteDotProductSinglePackedVector 1024 thrpt 15 2.627 ± 0.009 ops/us VectorUtilBenchmark.binaryHalfByteDotProductVector 1024 thrpt 15 20.785 ± 0.009 ops/us VectorUtilBenchmark.binaryHalfByteSquareBothPackedScalar 1024 thrpt 15 1.696 ± 0.001 ops/us VectorUtilBenchmark.binaryHalfByteSquareBothPackedVector 1024 thrpt 15 12.562 ± 0.023 ops/us VectorUtilBenchmark.binaryHalfByteSquareScalar 1024 thrpt 15 2.474 ± 0.010 ops/us VectorUtilBenchmark.binaryHalfByteSquareSinglePackedScalar 1024 thrpt 15 2.021 ± 0.006 ops/us VectorUtilBenchmark.binaryHalfByteSquareSinglePackedVector 1024 thrpt 15 2.609 ± 0.015 ops/us VectorUtilBenchmark.binaryHalfByteSquareVector 1024 thrpt 15 18.487 ± 0.075 ops/us VectorUtilBenchmark.binarySquareScalar 1024 thrpt 15 2.413 ± 0.021 ops/us VectorUtilBenchmark.binarySquareUint8Scalar 1024 thrpt 15 2.420 ± 0.017 ops/us VectorUtilBenchmark.binarySquareUint8Vector 1024 thrpt 15 6.709 ± 0.002 ops/us VectorUtilBenchmark.binarySquareVector 1024 thrpt 15 6.709 ± 0.002 ops/us VectorUtilBenchmark.floatCosineScalar 1024 thrpt 15 1.415 ± 0.002 ops/us VectorUtilBenchmark.floatCosineVector 1024 thrpt 75 8.646 ± 0.080 ops/us VectorUtilBenchmark.floatDotProductScalar 1024 thrpt 15 3.733 ± 0.003 ops/us VectorUtilBenchmark.floatDotProductVector 1024 thrpt 75 12.249 ± 0.046 ops/us VectorUtilBenchmark.floatSquareScalar 1024 thrpt 15 3.171 ± 0.008 ops/us VectorUtilBenchmark.floatSquareVector 1024 thrpt 75 12.483 ± 0.104 ops/us VectorUtilBenchmark.l2Normalize 1024 thrpt 15 3.017 ± 0.002 ops/us VectorUtilBenchmark.l2NormalizeVector 1024 thrpt 75 12.207 ± 0.764 ops/us ``` -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
