On Tue, Jan 23, 2024 at 12:56:52PM +0100, Richard Biener wrote:
> The following avoids using exact_log2 on the number of SIMD clone calls
> to be emitted when vectorizing calls since that can easily be not
> a power of two in which case it will return -1. For different simd
> clones the number of calls will differ by a multiply with a power of two
> only so using floor_log2 is good enough here.
>
> Bootstrap and regtest running on x86_64-unknown-linux-gnu.
>
> PR tree-optimization/113552
> * tree-vect-stmts.cc (vectorizable_simd_clone_call): Use
> floor_log2 instead of exact_log2 on the number of calls.
Is there any target which supports non-power-of-two simdlen?
If not, perhaps we should add !pow2p_hwi (num_calls) to the continue;
condition a few lines earlier?
> diff --git a/gcc/tree-vect-stmts.cc b/gcc/tree-vect-stmts.cc
> index 09749ae3817..1dbe1115da4 100644
> --- a/gcc/tree-vect-stmts.cc
> +++ b/gcc/tree-vect-stmts.cc
> @@ -4071,7 +4071,7 @@ vectorizable_simd_clone_call (vec_info *vinfo,
> stmt_vec_info stmt_info,
> || (nargs != simd_nargs))
> continue;
> if (num_calls != 1)
> - this_badness += exact_log2 (num_calls) * 4096;
> + this_badness += floor_log2 (num_calls) * 4096;
> if (n->simdclone->inbranch)
> this_badness += 8192;
> int target_badness = targetm.simd_clone.usable (n);
> --
> 2.35.3
Jakub