[CC: Richard S.]

Hi Wilco,

We use Nvidia TK1s (Cortex-A15) for benchmarking on 32-bit ARM.

LTO tends to increase functions due to additional inlining, which increases 
scheduling regions, which increases opportunities for the 1st scheduler for 
inter-block instruction moves, which increases register pressure.

SCHED_PRESSURE_MODEL handles cases with high register pressure well, and 
switching it off caused a few additional spills in the hot blocks, which caused 
the slow-down.

It may be worthwhile to bring SCHED_PRESSURE_MODEL back when LTO is enabled.

--
Maxim Kuvyrkov
https://www.linaro.org

> On 12 Jul 2021, at 13:25, Wilco Dijkstra <wilco.dijks...@arm.com> wrote:
> 
> Hi Maxim,
> 
> That sounds rather strange, huge differences due to scheduling are very rare. 
> Which micro architecture was this run on? I can try running it on trunk and 
> see what difference it makes with those options.
> 
> Cheers,
> Wilco
> IMPORTANT NOTICE: The contents of this email and any attachments are 
> confidential and may also be privileged. If you are not the intended 
> recipient, please notify the sender immediately and do not disclose the 
> contents to any other person, use it for any purpose, or store or copy the 
> information in any medium. Thank you.

_______________________________________________
linaro-toolchain mailing list
linaro-toolchain@lists.linaro.org
https://lists.linaro.org/mailman/listinfo/linaro-toolchain

Reply via email to