Paolo Bonzini wrote:
Attached you'll find the (preprocessed) source of the routine that
printed the Infinity's (of course, I cannot be completely certain that
it actually resulted in the wrong code, but at least it might be studied
to see if it helps to find the culprit).
No, this function is sane (the peephole *is* called a lot by this
function, but all is in due order). I looked at the dumps and assembly
for -O2, -O3 and -O3 -fno-schedule-insns (*), and all is as expected.
Yeah, it was probably too much to hope for.
Interestingly enough, you *should* expect a speedup when this is resolved...
The next guess then is that the RHXU and RHYV arrays are wrong. From
these, ZHXY is computed, and ZHXY is multiplied into each of the
outputs. Can you send the routine that computes those, or is it too big?
I can't get to my home system anymore (probably a blackout), so I'll
send it to you this evening.
Paolo
(*) it would have helped to know the compilation flags and target, of
course.
Yep, sorry: -g -O3 -ffast-math -mcpu=native -march=native
on a x86-64-unknown-linux-gnu system (native 64-bit).
--
Toon Moene, KNMI (Weer/Onderzoek), The Netherlands
Phone: +31 30 2206443; e-mail: mo...@knmi.nl