http://gcc.gnu.org/bugzilla/show_bug.cgi?id=38306
Manuel López-Ibáñez <manu at gcc dot gnu.org> changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |manu at gcc dot gnu.org --- Comment #25 from Manuel López-Ibáñez <manu at gcc dot gnu.org> 2011-09-10 09:43:58 UTC --- (In reply to comment #24) > > The issue is that at -O3 the subroutine PD2VAL is not vectorized, while it is > at -O2. If you are interested in investigating why this is so by yourself, I would suggest that you use the various -fdump- options to check what GCC is doing differently between the two variants. 1) Dump everything you can dump. 2) Then find the earliest optimization pass where they differ (you may even use diff to make this faster). 3) Check subsequent dumps to see if that difference is actually what makes -O3 to not vectorize. (At this point you can play with -f* -fno-* to reduce the differences further and isolate the trigger).