https://gcc.gnu.org/bugzilla/show_bug.cgi?id=85048
--- Comment #16 from Hongtao Liu ---
(In reply to Matthias Kretz (Vir) from comment #15)
> So it seems that if at least one of the vector builtins involved in the
> expression is 512 bits GCC needs to locally increase prefer-vector-width to
> 512
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=85048
--- Comment #15 from Matthias Kretz (Vir) ---
So it seems that if at least one of the vector builtins involved in the
expression is 512 bits GCC needs to locally increase prefer-vector-width to
512? Or, more generally:
prefer-vector-width = max(
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=85048
Hongtao Liu changed:
What|Removed |Added
CC||liuhongt at gcc dot gnu.org
--- Comment #1
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=85048
--- Comment #13 from Matthias Kretz (Vir) ---
Should I open a new PR for the remaining ((u)int64, 16) <-> (float, 16)
conversions?
https://godbolt.org/z/x3xPMYKj3
Note that __builtin_convertvector produces the code we want.
template
using V [
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=85048
--- Comment #12 from Uroš Bizjak ---
(In reply to Hongtao.liu from comment #9)
> With the patch, we can generate optimized code expect for those 16 {u,}qq
> cases, since the ABI doesn't support 1024-bit vector.
Can't these be vectorized using pa
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=85048
--- Comment #11 from Hongtao.liu ---
Fixed in GCC13.
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=85048
--- Comment #10 from CVS Commits ---
The master branch has been updated by hongtao Liu :
https://gcc.gnu.org/g:fe42e7fe119159f7443dbe68189e52891dc0148e
commit r13-6951-gfe42e7fe119159f7443dbe68189e52891dc0148e
Author: liuhongt
Date: Thu Mar
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=85048
--- Comment #9 from Hongtao.liu ---
With the patch, we can generate optimized code expect for those 16 {u,}qq
cases, since the ABI doesn't support 1024-bit vector.
1 file changed, 16 insertions(+), 2 deletions(-)
gcc/config/i386/sse.md | 18
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=85048
--- Comment #8 from Hongtao.liu ---
(In reply to Hongtao.liu from comment #7)
> Yes, Looks like the pattern name is misdefined.
> it shoud be fixuns_trunc, but we have ufix_trunc.
No, we have the right name but generate extra instructions for un
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=85048
Hongtao.liu changed:
What|Removed |Added
CC||crazylht at gmail dot com
--- Comment #7 f
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=85048
--- Comment #6 from Matthias Kretz (Vir) ---
Most of the conversions are optimized perfectly now. Only the following
conversions are still missing for AVX-512:
https://godbolt.org/z/9afWbYod6
#include
template
using V [[gnu::vector_size(Size)
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=85048
Devin Hussey changed:
What|Removed |Added
CC||husseydevin at gmail dot com
--- Comment
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=85048
--- Comment #4 from Marc Glisse ---
See PR77399.
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=85048
--- Comment #3 from Matthias Kretz ---
Just opened PR85052 for tracking __builtin_convertvector support.
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=85048
Richard Biener changed:
What|Removed |Added
Keywords||missed-optimization
Target|
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=85048
--- Comment #1 from Matthias Kretz ---
Godbolt link:
https://godbolt.org/#z:OYLghAFBqd5QCxAYwPYBMCmBRdBLAF1QCcAaPECAKxAEZSAbAQwDtRkBSAJgCFufSAZ1QBXYskwgA5NwDMeFsgYisAag6yAwskEF8LAhuwcADAEFTZgpgC2AB2bX1WpU0GDVAFVKqFBVQByPn6qAMp4AF6YzgAigaoAVKqCkZi
16 matches
Mail list logo