[Bug middle-end/31750] Suboptimal builtin_memset on x86 with SSE

2010-04-30 Thread jb at gcc dot gnu dot org
--- Comment #4 from jb at gcc dot gnu dot org 2010-04-30 18:02 --- Some more experimentation, on different hardware, reveals that the relative performance of "rep stos" vs. loop depends heavily on the size of the object to set, the optimization options (loop unrolling etc.), and presumab

[Bug middle-end/31750] Suboptimal builtin_memset on x86 with SSE

2007-04-29 Thread jb at gcc dot gnu dot org
--- Comment #3 from jb at gcc dot gnu dot org 2007-04-29 20:19 --- Hmm, try e.g. the following (with gfortran the fixed sized arrays can use builtin_memset whereas the allocatable arrays are done with a loop. Save as .F90 (capital F) to force preprocessing to set the SZ macro. ! Test pe

[Bug middle-end/31750] Suboptimal builtin_memset on x86 with SSE

2007-04-29 Thread rguenth at gcc dot gnu dot org
--- Comment #2 from rguenth at gcc dot gnu dot org 2007-04-29 19:58 --- Do you have a testcase that shows this behavior gcc vs. other? -- http://gcc.gnu.org/bugzilla/show_bug.cgi?id=31750

[Bug middle-end/31750] Suboptimal builtin_memset on x86 with SSE

2007-04-29 Thread jb at gcc dot gnu dot org
--- Comment #1 from jb at gcc dot gnu dot org 2007-04-29 12:14 --- Jakub made a related patch a while ago: http://gcc.gnu.org/ml/gcc-patches/2006-08/msg01134.html -- http://gcc.gnu.org/bugzilla/show_bug.cgi?id=31750