https://gcc.gnu.org/bugzilla/show_bug.cgi?id=62080
--- Comment #6 from Richard Biener ---
(In reply to Marc Glisse from comment #5)
> With the intrinsics patch, I notice that we don't simplify in gimple either:
>
> _40 = VIEW_CONVERT_EXPR<__m128i>(_39);
> MEM[(__m128i * {ref-all})vec_4(D)] =
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=62080
--- Comment #5 from Marc Glisse ---
With the intrinsics patch, I notice that we don't simplify in gimple either:
_40 = VIEW_CONVERT_EXPR<__m128i>(_39);
MEM[(__m128i * {ref-all})vec_4(D)] = _40;
_60 = MEM[(const double *)vec_4(D)];
_61 =
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=62080
--- Comment #4 from Benjamin Schindler ---
(In reply to Marc Glisse from comment #2)
> (note that a minimal, self-contained testcase would be much better and
> shouldn't be hard to produce)
I don't mind doing so, but I don't quite know what is
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=62080
--- Comment #3 from Benjamin Schindler ---
I just looked at what gcc-4.9.1 does and it does vary:
movdqu(%rsi), %xmm1
movdqu(%rdi), %xmm0 <--
pminsd%xmm1, %xmm0 <--
pxor%xmm1, %xmm1
pmaxsd%xmm1, %xmm0
movaps%xmm0, (%rsi
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=62080
--- Comment #2 from Marc Glisse ---
(note that a minimal, self-contained testcase would be much better and
shouldn't be hard to produce)
We write to memory with:
(insn 10 8 11 2 (set (mem:V2DI (reg/v/f:DI 97 [ vec ]) [0 MEM[(__m128i *
{ref-all}
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=62080
--- Comment #1 from Benjamin Schindler ---
Created attachment 33282
--> https://gcc.gnu.org/bugzilla/attachment.cgi?id=33282&action=edit
Generated assembly in full