------- Additional Comments From uros at kss-loka dot si 2005-09-30 09:39 ------- (In reply to comment #8)
> time for no recip pass (comment #4): 1.43 * 100 / 10.05 = 14.22 The time above is with -ffast-math, with current recip pass disabled > time for new recip pass (comment #6): 1.85 * 100 / 12.04 = 15.36 The time above is without -ffast-math, with current recip pass enabled I have done some more measusrement, with gcc version 4.1.0 20050930 and your patch commited. First results are with recip pass disabled ("&& 0" was added to gate_cse_reciprocals()). Three runs were performed: Without recip pass: povray -display=NONE abyss.pov: user 0m15.770s user 0m15.824s user 0m15.781s ================= 0m15.7917 Next results are with recip pass enabled: povray_recip -display=NONE abyss.pov: user 0m15.891s user 0m15.826s user 0m15.867s ================= 0m15.8613 (+ 0.44%) The povray was compiled with: CFLAGS = -pipe -O3 -msse -mfpmath=387 -ffast-math -D__NO_MATH_INLINES -msse2 -march=pentium4 -mtune=pentium4 -malign-double -minline-all-stringops CXXFLAGS = -pipe -Wno-multichar -O3 -msse -mfpmath=387 -ffast-math -D__NO_MATH_INLINES -msse2 -march=pentium4 -mtune=pentium4 -malign-double -minline-all-stringops In my case, recip pass marginally increases run times for -mfpmath=387. -- http://gcc.gnu.org/bugzilla/show_bug.cgi?id=24123