http://gcc.gnu.org/bugzilla/show_bug.cgi?id=58095
--- Comment #4 from Siavash Eliasi ---
In the end, here is what I really like GCC to generate for me. Same output as
function (bar) for function (foo) when using GCC with -O3 -march=core2
switches:
#include
#define BATCHSIZE 8
void foo(__m128
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=58095
--- Comment #3 from Siavash Eliasi ---
I did an experiment with using raw float data types instead of __m128 data
type. This time GCC, Clang and ICC were able to generate desired code, even
without using __restric__ keyword, but a little more dirt
http://gcc.gnu.org/bugzilla/show_bug.cgi?id=58095
--- Comment #2 from Siavash Eliasi ---
(In reply to Andrew Pinski from comment #1)
> >I've tried using __restrict__ keyword for input data (foo2),
>
> I think you want __restrict__ inside of the [].
Do you mind pasting the modified source code a
Priority: P3
Component: c++
Assignee: unassigned at gcc dot gnu.org
Reporter: siavashserver at gmail dot com
Created attachment 30621
--> http://gcc.gnu.org/bugzilla/attachment.cgi?id=30621&action=edit
Source code and its generated asm code.
Hello. I have no