On 10/08/2012 06:31 AM, Steven Bosscher wrote:
On Sun, Oct 7, 2012 at 5:59 PM, Vladimir Makarov wrote:
* lra-lives.c (lra_start_point_ranges, lra_finish_point_ranges):
Remove.
(process_bb_lives): Change start regno in
EXECUTE_IF_SET_IN_BITMAP. Iterate on DF_L
On Sun, Oct 7, 2012 at 5:59 PM, Vladimir Makarov wrote:
> * lra-lives.c (lra_start_point_ranges, lra_finish_point_ranges):
> Remove.
> (process_bb_lives): Change start regno in
> EXECUTE_IF_SET_IN_BITMAP. Iterate on DF_LR_IN (bb) instead of
> pseudos_live_th
On Mon, Oct 8, 2012 at 10:18 AM, Jakub Jelinek wrote:
>> > I'm playing with a patch to expand the insns_with_changed_offsets
>> > bitmap to an sbitmap, and will send a patch if this works better.
>>
>> Or make insns_with_changed_offsets a VEC of insns (or a pointer-set).
>
> Or use temporarily som
On Mon, Oct 08, 2012 at 09:20:47AM +0200, Richard Guenther wrote:
> On Sun, Oct 7, 2012 at 11:27 PM, Steven Bosscher
> wrote:
> > The next bottle-neck in my timings is in
> > lra-eliminate.c:lra_eliminate(), in this loop:
> >
> >FOR_EACH_BB (bb)
> > FOR_BB_INSNS_SAFE (bb, insn, temp)
> >
On Sun, Oct 7, 2012 at 11:27 PM, Steven Bosscher wrote:
> On Sun, Oct 7, 2012 at 5:59 PM, Vladimir Makarov wrote:
>> The following patch speeds LRA up more on PR54146. Below times for
>> compilation of the test on gcc17.fsffrance.org (an AMD machine):
>>
>> Before:
>> real=1214.71 user=1192.05 sy
On Sun, Oct 7, 2012 at 5:59 PM, Vladimir Makarov wrote:
> The following patch speeds LRA up more on PR54146. Below times for
> compilation of the test on gcc17.fsffrance.org (an AMD machine):
>
> Before:
> real=1214.71 user=1192.05 system=22.48
> After:
> real=1144.37 user=1124.31 system=20.11
Hi
Hi Vlad,
Thanks for working on this!
> - EXECUTE_IF_SET_IN_BITMAP (reg_live_out, 0, j, bi)
> -if (j >= FIRST_PSEUDO_REGISTER)
> - mark_pseudo_live (j);
> + EXECUTE_IF_SET_IN_BITMAP (reg_live_out, FIRST_PSEUDO_REGISTER, j, bi)
> +mark_pseudo_live (j);
FWIW, the above is optimized b
The following patch speeds LRA up more on PR54146. Below times for
compilation of the test on gcc17.fsffrance.org (an AMD machine):
Before:
real=1214.71 user=1192.05 system=22.48
After:
real=1144.37 user=1124.31 system=20.11
The patch should not change the generated code. About 2/3 of speed u