ICX, 9% regression on znver3

crazylht at gmail dot com via Gcc-bugs Wed, 28 Apr 2021 20:02:31 -0700

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100173


--- Comment #2 from Hongtao.liu <crazylht at gmail dot com> ---

> but yes, cselim will also sink the first store, moving it across the
> scalar compute in the block.  I might note that ideally we'd sink
> all the compute as well and end up with just a conditional load of
> either pIn1->m_esState or pIn2_89->m_esState.  That might then allow
> scheduling to recover the original performance.
> 

I want to clasify this regression is not related to 2 sinked stores, it just
trigger some micro-architecture bound.

Also w/o -fvect-cost-model=very-cheap, it can be 2-3x faster, the tripper count
is constant, so i wonder why very-cheap cost model doesn't vectorize this loop?

[Bug tree-optimization/100173] telecom/viterb00data_1 has 16.92% regression compared O2 -ftree-vectorize -fvect-cost-model=very-cheap to O2 on CLX/ICX, 9% regression on znver3

Reply via email to