On Tue, Jan 23, 2024 at 12:56:52PM +0100, Richard Biener wrote:
> The following avoids using exact_log2 on the number of SIMD clone calls
> to be emitted when vectorizing calls since that can easily be not
> a power of two in which case it will return -1.  For different simd
> clones the number of calls will differ by a multiply with a power of two
> only so using floor_log2 is good enough here.
> 
> Bootstrap and regtest running on x86_64-unknown-linux-gnu.
> 
>       PR tree-optimization/113552
>       * tree-vect-stmts.cc (vectorizable_simd_clone_call): Use
>       floor_log2 instead of exact_log2 on the number of calls.

Is there any target which supports non-power-of-two simdlen?
If not, perhaps we should add !pow2p_hwi (num_calls) to the continue;
condition a few lines earlier?

> diff --git a/gcc/tree-vect-stmts.cc b/gcc/tree-vect-stmts.cc
> index 09749ae3817..1dbe1115da4 100644
> --- a/gcc/tree-vect-stmts.cc
> +++ b/gcc/tree-vect-stmts.cc
> @@ -4071,7 +4071,7 @@ vectorizable_simd_clone_call (vec_info *vinfo, 
> stmt_vec_info stmt_info,
>           || (nargs != simd_nargs))
>         continue;
>       if (num_calls != 1)
> -       this_badness += exact_log2 (num_calls) * 4096;
> +       this_badness += floor_log2 (num_calls) * 4096;
>       if (n->simdclone->inbranch)
>         this_badness += 8192;
>       int target_badness = targetm.simd_clone.usable (n);
> -- 
> 2.35.3

        Jakub

Reply via email to