https://gcc.gnu.org/g:51d831bd7cd122511d03efcc3da2de343a16553a
commit 51d831bd7cd122511d03efcc3da2de343a16553a Author: Richard Biener <rguent...@suse.de> Date: Wed Aug 23 10:48:32 2023 +0200 Fail vectorization when not SLP with --param vect-force-slp=1 The following adds --param vect-force-slp allowing to indicate failure when not all stmts participating in loop vectorization are using SLP vectorization. This is intended for transitioning and debugging. Enabling this without further changes results in the following within vect.exp on x86_64 === g++ Summary === -# of expected passes 619 +# of expected passes 546 +# of unexpected failures 73 === gcc Summary === -# of expected passes 8835 -# of expected failures 256 +# of expected passes 7271 +# of unexpected failures 1564 +# of unexpected successes 12 +# of expected failures 244 === gfortran Summary === -# of expected passes 171 +# of expected passes 144 +# of unexpected failures 27 * params.opt (-param=vect-force-slp=): New, default to 0. * doc/invoke.texi (--param vect-force-slp): Document. * tree-vect-stmts.cc (vect_analyze_stmt): With --param vect-force-slp=1 fail vectorization when not using SLP. Diff: --- gcc/doc/invoke.texi | 4 ++++ gcc/params.opt | 4 ++++ gcc/tree-vect-stmts.cc | 6 ++++++ 3 files changed, 14 insertions(+) diff --git a/gcc/doc/invoke.texi b/gcc/doc/invoke.texi index ddcd5213f06a..3bd02fb13e5e 100644 --- a/gcc/doc/invoke.texi +++ b/gcc/doc/invoke.texi @@ -16747,6 +16747,10 @@ this parameter. The default value of this parameter is 50. @item vect-induction-float Enable loop vectorization of floating point inductions. +@item vect-force-slp +Fail vectorization when falling back to non-SLP. This is intended for +debugging only. + @item vrp-sparse-threshold Maximum number of basic blocks before VRP uses a sparse bitmap cache. diff --git a/gcc/params.opt b/gcc/params.opt index d34ef545bf03..74ea9c6f8d93 100644 --- a/gcc/params.opt +++ b/gcc/params.opt @@ -1198,6 +1198,10 @@ The maximum factor which the loop vectorizer applies to the cost of statements i Common Joined UInteger Var(param_vect_induction_float) Init(1) IntegerRange(0, 1) Param Optimization Enable loop vectorization of floating point inductions. +-param=vect-force-slp= +Common Joined UInteger Var(param_vect_force_slp) Init(0) IntegerRange(0, 1) Param Optimization +Fail vectorization when falling back to non-SLP. + -param=vrp-sparse-threshold= Common Joined UInteger Var(param_vrp_sparse_threshold) Init(3000) Optimization Param Maximum number of basic blocks before VRP uses a sparse bitmap cache. diff --git a/gcc/tree-vect-stmts.cc b/gcc/tree-vect-stmts.cc index b8a71605f1bc..f99dce38bf7b 100644 --- a/gcc/tree-vect-stmts.cc +++ b/gcc/tree-vect-stmts.cc @@ -13257,6 +13257,12 @@ vect_analyze_stmt (vec_info *vinfo, return opt_result::success (); } + if (param_vect_force_slp && !node) + return opt_result::failure_at (stmt_info->stmt, + "not vectorized:" + " not part of SLP but SLP forced: %G", + stmt_info->stmt); + ok = true; if (!bb_vinfo && (STMT_VINFO_RELEVANT_P (stmt_info)