On 08/01/07, Revital1 Eres <[EMAIL PROTECTED]> wrote:
-ftree-vectorize flag is missing.
(see http://gcc.gnu.org/projects/tree-ssa/vectorization.html for more info
about
the flags you should use)
Ah, didn't know about that. I don't have that flag on my main dev machine
(still using 3.4 branch)
Apologies if this is the wrong list.
I'm afraid I'm not much of an assembly programmer, but I was just
wondering if this generated code was 'correct', because from descriptions
of SSE that I've read, it looks like it's inefficient.
The C code:
float *vector_add4f(float va[4], const float vb[4])