On 12/09/14 14:48, shmeel gutl wrote:
On 09-Dec-14 07:56 PM, Jeff Law wrote:
On 12/09/14 10:10, Vladimir Makarov wrote:
generate the correct code in many cases even for x86.  Jeff Law tried
IRA coloring reusage too for reload but whole RA became slower (although
he achieved performance improvements on x86).
Right.  After IRA was complete, I'd walk over the unallocated allocnos
and split their ranges at EBB boundaries.  That created new allocnos
with a smaller conflict set and reduced the conflict set for the
original unallocated allocnos.

After I'd done that splitting for all the EBBs, I called back into
ira_reassign_pseudos to try to assign the original unallocated
allocnos as well as the new allocnos.

To get good results, much of IRA's cost analysis had to be redone from
scratch.  And from a compile-time standpoint, that's a killer.

The other approach I was looking at was a backwards walk through each
block.  When I found an insn with an unallocated pseudo that would
trigger one of various range spliting techniques to try and free up a
hard register.  Then again I'd call into ira_reassign_pseudos to try
the allocations again.  This got even better results, but was
obviously even more compile-time expensive.

I don't think much, if any, of that work is relevant given the current
structure and effectiveness of LRA.

jeff


Are any of these versions available? I wouldn't mind a 50% penalty in
compile time if it gave me a 5% improvement in the generated code.
They'd be horribly bitrotted by now and you'd have to rewire them into current versions of GCC. I'd be virtually certain you're not going to get a 5% improvement. If you got even a half percent over what LRA does I'd be *really* surprised.

jeff

Reply via email to