https://gcc.gnu.org/bugzilla/show_bug.cgi?id=88464
--- Comment #8 from Moritz Kreutzer ---
Thanks for the input and for confirming that "for conditional ones (both
MASK_LOAD and MASK_STORE) the support for the cases when using a mask register
rather than a vector register with mask either hasn't
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=88464
--- Comment #11 from Moritz Kreutzer ---
Jakub, I can confirm it's working for masked gathers (we have a similar pattern
elsewhere in our code) with the latest trunk. Thanks for looking at the
scatters as well!
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=88464
--- Comment #16 from Moritz Kreutzer ---
I can confirm the fix from my side.
Thanks again!
Priority: P3
Component: tree-optimization
Assignee: unassigned at gcc dot gnu.org
Reporter: moritz.kreutzer at siemens dot com
Target Milestone: ---
We have a simple loop to select values based on a condition from one array and
store the selected values contiguously in a
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91198
--- Comment #2 from Moritz Kreutzer ---
Sure, I should have said that I'm talking about auto vectorization. I'm aware
that we could use intrinsics, but of course that'll always be our last resort
for obvious reasons.
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91198
--- Comment #4 from Moritz Kreutzer ---
> How would a vectorized version with the intrinsic look like?
Something along the lines of (assuming insize is a multiple of 16):
__mmask16 mask;
++
Assignee: unassigned at gcc dot gnu.org
Reporter: moritz.kreutzer at siemens dot com
Target Milestone: ---
Created attachment 46899
--> https://gcc.gnu.org/bugzilla/attachment.cgi?id=46899&action=edit
Preprocessed source and backtrace
Hi,
we are getting an ICE with the lates
ormal
Priority: P3
Component: c
Assignee: unassigned at gcc dot gnu.org
Reporter: moritz.kreutzer at siemens dot com
Target Milestone: ---
We have a loop in which we are scattering data to an array of length "n" where
can assure no write conflicts only within
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=89618
--- Comment #3 from Moritz Kreutzer ---
Great, thanks for the quick action Richard!
Priority: P3
Component: c++
Assignee: unassigned at gcc dot gnu.org
Reporter: moritz.kreutzer at siemens dot com
Target Milestone: ---
Godbolt worksheet: https://godbolt.org/z/F6m5hl
GCC (trunk and all earlier versions) fails to vectorize (SSE/AVX2/AVX-512
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=89653
--- Comment #7 from Moritz Kreutzer ---
Thanks for taking this up Richard! I just want to check back: Do you need any
assistance with testing or more information from my side?
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=89653
--- Comment #11 from Moritz Kreutzer ---
I am currently out of the office, with limited to no email access. I will be
returning on November 28. For urgent questions regarding ARM64 support please
contact Julian Hornich, for GPGPU-related issues p
12 matches
Mail list logo