On Wed, Oct 5, 2022 at 5:39 PM Eric Botcazou via Gcc-patches <gcc-patches@gcc.gnu.org> wrote: > > Hi, > > as shown by the attached testcase, there is a loophole in the unroll-and-jam > pass that can quickly result in wrong code generation. The code reads: > > if (!compute_data_dependences_for_loop (outer, true, &loop_nest, > &datarefs, &dependences)) > { > if (dump_file && (dump_flags & TDF_DETAILS)) > fprintf (dump_file, "Cannot analyze data dependencies\n"); > free_data_refs (datarefs); > free_dependence_relations (dependences); > continue; > } > > but compute_data_dependences_for_loop may return true even if the analysis is > reported as failing by compute_affine_dependence for some dependence pair: > > (compute_affine_dependence > ref_a: data[_14], stmt_a: data[_14] = i_59; > ref_b: data[_14], stmt_b: data[_14] = i_59; > Data ref a: > #(Data Ref: > # bb: 12 > # stmt: data[_14] = i_59; > # ref: data[_14]; > # base_object: data; > # Access function 0: scev_not_known; > #) > Data ref b: > #(Data Ref: > # bb: 12 > # stmt: data[_14] = i_59; > # ref: data[_14]; > # base_object: data; > # Access function 0: scev_not_known; > #) > affine dependence test not usable: access function not affine or constant. > ) -> dependence analysis failed > > Note that this is a self-dependence pair and the code for them reads: > > /* Nothing interesting for the self dependencies. */ > if (dra == drb) > continue; > > This means that the pass may reorder "complex" accesses to the same memory > location in successive iterations, which is OK for reads but not for writes. > > Proposed fix attached, tested on x86-64/Linux, OK for all active branches?
+ if (DR_IS_WRITE (dra) + && !DR_ACCESS_FNS (dra).is_empty () + && DDR_ARE_DEPENDENT (ddr) == chrec_dont_know) + { I'm wondering if testing DR_IS_WRITE (dra) is enough here and whether the logic also applies to RAW and WAR. So should it be either (DR_IS_WRITE (dra) || DR_IS_WRITE (drb)) or DR_IS_WRITE (dra) && DR_IS_WRITE (drb) instead? Otherwise thanks for catching. Richard. > > 2022-10-05 Eric Botcazou <ebotca...@adacore.com> > > * gimple-loop-jam.cc (tree_loop_unroll_and_jam): Bail out for a self > dependency that is a write-after-write if the access function is not > affine or constant. > > > 2022-10-05 Eric Botcazou <ebotca...@adacore.com> > > * gcc.c-torture/execute/20221005-1.c: New test. > > -- > Eric Botcazou