100x perfomance regression between gcc 3.4.5 and gcc 4.X

2006-03-11 Thread Nickolay Kolchin
During "bashmark" memory benchmark perfomance analyze, I found 100x perfomance regression between gcc 3.4.5 and gcc 4.X. -- test_cmd.cpp (simplified bashmark memory RW test) --- #include #include template static void int_membench(uint8_t* mb1, uint8_t* mb2) { for(uint32_t i = 0; i <

Re: 100x perfomance regression between gcc 3.4.5 and gcc 4.X

2006-03-12 Thread Nickolay Kolchin
On 3/12/06, Richard Guenther <[EMAIL PROTECTED]> wrote: > On 3/12/06, Nickolay Kolchin <[EMAIL PROTECTED]> wrote: > > During "bashmark" memory benchmark perfomance analyze, I found 100x > > perfomance > > regression between gcc 3.4.5 and gcc 4.X. >

Re: 100x perfomance regression between gcc 3.4.5 and gcc 4.X

2006-03-12 Thread Nickolay Kolchin
On 3/12/06, Steven Bosscher <[EMAIL PROTECTED]> wrote: > > It is valid. We should understand why this behavior has changed so > drastically. > I've attached assembler output from different compiler versions: 3.4.5-athlon-xp: gcc-3.4.5 -O3 -march=athlon-xp 3.4.5-pentium4: gcc-3.4.5 -O3 -march=pe