Andrew,
On 02.11.2019 2:22, Andrew Pinski wrote:
On Fri, Nov 1, 2019 at 7:03 AM Anton Youdkevitch
<anton.youdkevi...@bell-sw.com> wrote:
Hello,
Here is the one-liner that fixes the incorrect
vec_perm cost for thunderx2t99 chip.
With the patch applied 526.blender of CPU2017
gets ~5% improvement with no measurable changes
for other benchmarks.
Bootstrapped OK on aarch64-linux-gnu.
OK for trunk?
Maybe the big question is vec_perm used for both 1 input and 2 input
cases? If so maybe splitting the two cases would be important too.
It is as there is no per-number-of-operands distinction while
computing the vector permutation cost.
However, since 1-operand permutes are rare this would be a good
approximation (statistically).
Otherwise this is ok from my point of view but I can't approve it.
Thanks,
Andrew Pinski
2019-11-01 Anton Youdkevitch <anton.youdkevi...@bell-sw.com>
* gcc/config/aarch64/aarch64.c (thunderx2t99_vector_cost):
change vec_perm field
--
Thanks,
Anton