[Numpy-discussion] Enhancement for AArch64 SVE instruction set

2022-05-19 Thread kawakami . k
Hello, 

I am working on speeding up NumPy with the AArch64 SVE instruction set.
I could not find a numpy implementation for SVE. Is there already a test 
implementation or discussion about SVE support?
___
NumPy-Discussion mailing list -- numpy-discussion@python.org
To unsubscribe send an email to numpy-discussion-le...@python.org
https://mail.python.org/mailman3/lists/numpy-discussion.python.org/
Member address: arch...@mail-archive.com


[Numpy-discussion] Re: Enhancement for AArch64 SVE instruction set

2022-05-19 Thread Matti Picus

On 19/5/22 16:50, kawakam...@fujitsu.com wrote:


Hello,

I am working on speeding up NumPy with the AArch64 SVE instruction set.
I could not find a numpy implementation for SVE. Is there already a test 
implementation or discussion about SVE support?


OpenBLAS apparently has SVE support from 0.3.20 [0], which was merged to 
NumPy yesterday.



So far we have only support for NEON and ASIMD in NumPy, see [1] for a 
description of the way we use intrinsics. Contributions to improve the 
implementations and the documentation would be welcome. We currently use 
travis CI to run our aarch64 tests, if there was a way to get access to 
more advanced machines that also would be good. Do you know of 
commercially available machines with SVE or SVE2 support?



Matti


[0] 
https://github.com/xianyi/OpenBLAS/blob/faf58d2b3ffb20fd334cab080700be564ef74ec2/Changelog.txt#L28


[1] https://numpy.org/devdocs/reference/simd/build-options.html



___
NumPy-Discussion mailing list -- numpy-discussion@python.org
To unsubscribe send an email to numpy-discussion-le...@python.org
https://mail.python.org/mailman3/lists/numpy-discussion.python.org/
Member address: arch...@mail-archive.com