> I "implemented" branch delay slots (define_delay) for my > architecture and I use the command line option -fdelayed-branch. But > branch delay slot filling is done just for a few candidates. Even for > the same rule within the same compilation unit (C file) it is done in > a few cases but not in all. How can this happen?
You should not need to pass -fdelayed-branch explicitly. If you do, this means you're compiling at -O0, in which case the problem you run into is not very surprising. Just compile with bare -O1 at a minimum. -- Eric Botcazou