During "bashmark" memory benchmark perfomance analyze, I found 100x perfomance
regression between gcc 3.4.5 and gcc 4.X.
-- test_cmd.cpp (simplified bashmark memory RW test) ---
#include
#include
template
static void int_membench(uint8_t* mb1, uint8_t* mb2)
{
for(uint32_t i = 0; i <
On 3/12/06, Richard Guenther <[EMAIL PROTECTED]> wrote:
> On 3/12/06, Nickolay Kolchin <[EMAIL PROTECTED]> wrote:
> > During "bashmark" memory benchmark perfomance analyze, I found 100x
> > perfomance
> > regression between gcc 3.4.5 and gcc 4.X.
>
On 3/12/06, Steven Bosscher <[EMAIL PROTECTED]> wrote:
>
> It is valid. We should understand why this behavior has changed so
> drastically.
>
I've attached assembler output from different compiler versions:
3.4.5-athlon-xp: gcc-3.4.5 -O3 -march=athlon-xp
3.4.5-pentium4: gcc-3.4.5 -O3 -march=pe