SSE (Pentium 3) - Is this correct?

2007-01-07 Thread mal content
Apologies if this is the wrong list. I'm afraid I'm not much of an assembly programmer, but I was just wondering if this generated code was 'correct', because from descriptions of SSE that I've read, it looks like it's inefficient. The C code: float *vector_add4f(float va[4], const float vb[4])

Re: SSE (Pentium 3) - Is this correct?

2007-01-07 Thread mal content
On 08/01/07, Revital1 Eres <[EMAIL PROTECTED]> wrote: -ftree-vectorize flag is missing. (see http://gcc.gnu.org/projects/tree-ssa/vectorization.html for more info about the flags you should use) Ah, didn't know about that. I don't have that flag on my main dev machine (still using 3.4 branch)