https://gcc.gnu.org/bugzilla/show_bug.cgi?id=113677
Bug ID: 113677
Summary: Missing `VEC_PERM_EXPR <{a, CST}, CST, {0, 1, 2,
...}>` optimization
Product: gcc
Version: 14.0
Status: UNCONFIRMED
Keywords: missed-optimization
Severity: enhancement
Priority: P3
Component: tree-optimization
Assignee: unassigned at gcc dot gnu.org
Reporter: pinskia at gcc dot gnu.org
Target Milestone: ---
Take:
```
#define vect32 __attribute__((vector_size(4) ))
#define vect64 __attribute__((vector_size(8) ))
vect64 unsigned char f(vect32 unsigned char a)
{
vect32 unsigned char zero={0,0,0,0};
return __builtin_shufflevector (a, zero, 0, 1, 2, 3, 4, 5, 6, 7);
}
```
On x86_64 this produces:
```
f:
movd xmm0, edi
pxor xmm1, xmm1
punpckldq xmm0, xmm1
ret
```
We should just produce:
```
movd xmm0, edi
ret
```
In .optimized we get:
```
_1 = {a_2(D), { 0, 0, 0, 0 }};
_3 = VEC_PERM_EXPR <_1, { 0, 0, 0, 0, 0, 0, 0, 0 }, { 0, 1, 2, 3, 8, 9, 10,
11 }>;
return _3;
```
But _3 and _1 are the same ...