uschindler commented on PR #12311: URL: https://github.com/apache/lucene/pull/12311#issuecomment-1557153092
Hi, > With 256 bit vectors it is fast using ByteVector.SPECIES_64, ShortVector.SPECIES_128, and IntVector.SPECIES_256 But for ARM which only has 128-bit vectors, the generic code using only "SPECIES_PREFERRED" isn't as fast as it should be: almost 2x but not 4x like on avx-256. Maybe because it needs twice as much iterations all creating tons of instances on heap (until escape analysis kicks in?). -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@lucene.apache.org For additional commands, e-mail: issues-h...@lucene.apache.org