Hi, I have a piece of code that runs ~70% slower on a Xeon CPU with SSE enable than with plain 387.
[EMAIL PROTECTED] ~]$ uname -a FreeBSD enc1 6.2-RELEASE FreeBSD 6.2-RELEASE #0: Fri Jan 12 11:05:30 UTC 2007 [EMAIL PROTECTED]:/usr/obj/usr/src/sys/SMP [EMAIL PROTECTED] ~]$ gcc42 -v Using built-in specs. Target: i386-portbld-freebsd6.2 Configured with: ./..//gcc-4.2-20071024/configure --disable-nls --with-system-zlib --with-libiconv-prefix=/usr/local --with-gmp=/usr/local --program-suffix=42 --libdir=/usr/local/lib/gcc-4.2.3 --with-gxx-include-dir=/usr/local/lib/gcc-4.2.3/include/c++/ --disable-rpath --prefix=/usr/local --mandir=/usr/local/man --infodir=/usr/local/info/gcc42 i386-portbld-freebsd6.2 Thread model: posix gcc version 4.2.3 20071024 (prerelease) [EMAIL PROTECTED] ~]$ gcc42 ssucks.c -O2 -march=prescott -o ssucks-387 [EMAIL PROTECTED] ~]$ gcc42 ssucks.c -O2 -march=prescott -o ssucks-sse -mfpmath=sse [EMAIL PROTECTED] ~]$ ssucks-387 ; ssucks-sse FLOPS C Program (Double Precision), V2.0 18 Dec 1992 Module Error RunTime MFLOPS (usec) 1 4.0146e-13 0.0147 953.0052 2 -1.4166e-13 0.0061 1149.6845 FLOPS C Program (Double Precision), V2.0 18 Dec 1992 Module Error RunTime MFLOPS (usec) 1 4.0146e-13 0.0146 960.7945 2 -1.4166e-13 0.0281 249.3171 [EMAIL PROTECTED] ~]$ 1149.6845 vs 249.3171: a ~78% slowdown by just enabling sse I have source and output files here: http://teknoraver.campuslife.it/software/gcc-sse/