Hi Steven,
Nice to hear from you.
On 23 July 2012 19:25, Steven Bosscher wrote:
> Hello Ramana,
>
> For your PGO list:
>
> * please note that I've been working on PGO for switch code, and also
> for chains of if-statements with a common condition variable (with Tom
> de Vries)
Yes, that's what
Hello Ramana,
For your PGO list:
* please note that I've been working on PGO for switch code, and also
for chains of if-statements with a common condition variable (with Tom
de Vries)
* turning conditional execution off will not make a difference, your
profile information will be exactly the sam
On 15 November 2011 09:19, Richard Sandiford
wrote:
> Revital Eres writes:
>>> chain, so what makes the SMS version of it worse than the non-SMS version?
>>
>> I attached the SMS dump file. The problematic loop is the one with
>> "SMS succeeded 36 2" (there are three loops in total in this file).
Hi,
> Anyway, I think this explains why the non-SMS loop executes more
> quickly than GCC expects, and why the SMS loop is slower than it
> needs to be. It might be worth comparing the two loops with
> -mtune=cortex-a8.
Thanks for the detailed explanation!
I see this regression on cortex-a8 as
Revital Eres writes:
>> chain, so what makes the SMS version of it worse than the non-SMS version?
>
> I attached the SMS dump file. The problematic loop is the one with
> "SMS succeeded 36 2" (there are three loops in total in this file).
> Due to these accumulators min ii is 36 which seems to ca
Revital Eres writes:
> Another issue is related to the regression I saw with SMS in libav's
> dsputil-ssd_int8_vs_int16_c.
> Consulting with Ayal regarding this it seemed that the
> regression was due to dependence between accumulations that can be
> avoided, more specifically we had the following
Hi,
> Are there any other topics that folks want to bring up ?
There are some issues exposed while testing the register pressure
estimation for SMS that I would to get some feedback on:
As discussed off-line; one thing is related to the note_uses function
which currently does not take element z