------- Comment #7 from rguenth at gcc dot gnu dot org 2009-04-18 10:01 ------- For better speed with -march=pentium2 you should add -mtune=generic which will use only pentium2 features but tunes the code to not pessimize newer processors.
That said, without a testcase and maybe some analysis (like a profile comparison) there is nothing we can do. If you want to play with some flags a bit I would suggest to try -fno-tree-pre and/or -fno-ivopts and/or -funroll-loops. Using profile-feedback will also help reducing code size and increase performance. -- http://gcc.gnu.org/bugzilla/show_bug.cgi?id=35671