------- Comment #1 from rguenth at gcc dot gnu dot org 2008-02-07 10:29 ------- The testcase looks completely memory bound. Does the compiler tell you it does vectorization at all? Have you tried without -fprefetch-loop-arrays (with todays HW prefetchers and the simple access patterns it's probably not a win here).
-- http://gcc.gnu.org/bugzilla/show_bug.cgi?id=35117