rmuir commented on issue #12621:
URL: https://github.com/apache/lucene/issues/12621#issuecomment-1747026111

   Also their suggested replacement of 3 instructions for the `VPDPBUSD` is: 
   
   > Likewise, for 8-bit values, three instructions are needed - VPMADDUBSW 
which is used to multiply two 8-bit pairs and add them together, followed by a 
VPMADDWD with the value 1 in order to simply up-convert the 16-bit values to 
32-bit values, followed by the VPADDD instruction which adds the result to an 
accumulator. 
   
   I can tell you this is also not what is happening. We have no ability to 
write AVX-512-specific code and currently have to support ARM, machines with 
only AVX-256, etc. 
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org
For additional commands, e-mail: issues-h...@lucene.apache.org

Reply via email to