https://gcc.gnu.org/bugzilla/show_bug.cgi?id=113727

Richard Biener <rguenth at gcc dot gnu.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |jamborm at gcc dot gnu.org

--- Comment #21 from Richard Biener <rguenth at gcc dot gnu.org> ---
Ah, so the mistake happens in 135.sra which does

   <bb 2> [local count: 178992760]:
-  as.f3 = 5;
+  as$f3_6 = 5;

   <bb 3> [local count: 894749064]:
   # y_24 = PHI <y_14(5), 0(2)>
   # as_27 = PHI <as_12(5), 169(2)>
+  # as$f3_8 = PHI <as$f3_4(5), as$f3_6(2)>
   _1 = as_27 & 31;
   if (_1 != 0)
     goto <bb 5>; [50.00%]
@@ -26,12 +39,12 @@
     goto <bb 4>; [50.00%]

   <bb 4> [local count: 447374532]:
-  cstore_19 = MEM <struct f> [(void *)&as].f3;
+  cstore_19 = as$f3_8;

   <bb 5> [local count: 894749064]:
   # as_12 = PHI <as_27(4), 66(3)>
   # cstore_20 = PHI <cstore_19(4), 154(3)>
-  MEM <struct f> [(void *)&as].f3 = cstore_20;
+  as$f3_4 = cstore_20;
   y_14 = y_24 + 1;
   if (y_14 <= 4)
     goto <bb 3>; [80.00%]
@@ -41,8 +54,12 @@
   <bb 6> [local count: 178992760]:
   # as_28 = PHI <as_12(5)>
   BIT_FIELD_REF <as, 8, 0> = as_28;
+  as$f3_22 = as.f3;
+  as.f3 = as$f3_22;
   aq1 = as;


note how we elide as.f3 but in BB6 fail to process the BIT_FIELD_REF
but then re-materialize as.f3 as if 'as' were fully stored to by
the BIT_FIELD_REF.

The BIT_FIELD_REF should have triggered re-materialization before it.

Upon handling BIT_FIELD_REF <as, 8, 0> = as_28; we create the re-load
of as.f3, but as said we fail to re-materialize 'as' before it from the
replacement.

For the following aggregate copy we run into

      if (access_has_children_p (lacc)
          && access_has_children_p (racc)
          /* When an access represents an unscalarizable region, it usually
             represents accesses with variable offset and thus must not be used
             to generate new memory accesses.  */
          && !lacc->grp_unscalarizable_region
          && !racc->grp_unscalarizable_region)
        {
          struct subreplacement_assignment_data sad;

          sad.left_offset = lacc->offset;
          sad.assignment_lhs = lhs;
          sad.assignment_rhs = rhs; 
          sad.top_racc = racc;
          sad.old_gsi = *gsi;
          sad.new_gsi = gsi;
          sad.loc = gimple_location (stmt);
          sad.refreshed = SRA_UDH_NONE;

          if (lacc->grp_read && !lacc->grp_covered)
            handle_unscalarized_data_in_subtree (&sad);

which I think is a similar situation in that the BIT_FIELD_REF on the LHS
overlaps with replacements and is a RMW operation.  I think SRA simply
assumes that any non-aggregate copy will hever partially invalidate
replacements?

I'm not sure how BIT_FIELD_REF was handled (and worked) before my change,
we record the whole variable as access for the BIT_FIELD_REF write
(but with ->grp_partial_lhs set).  But we do not look at grp_partial_lhs
when analyzing for overlaps.

The following fixes this, but a "better" change would be to record the
proper extent, including the BIT_FIELD_REF, even for LHS?  Before my RHS
handling change we likely always produced a replacement for the BIT_FIELD_REF
base and kept the BIT_FIELD_REFs around, correct?

diff --git a/gcc/tree-sra.cc b/gcc/tree-sra.cc
index f8e71ec48b9..848bb8b89e0 100644
--- a/gcc/tree-sra.cc
+++ b/gcc/tree-sra.cc
@@ -2269,6 +2269,11 @@ sort_and_splice_var_accesses (tree var)
           && TREE_CODE (access->expr) == COMPONENT_REF
           && DECL_BIT_FIELD (TREE_OPERAND (access->expr, 1)));

+      /* When there is a partial LHS involved we have no way to see what it
+        accesses, so if it's not the only access we have to fail.  */
+      if (access->grp_partial_lhs && access_count != 1)
+       return NULL;
+
       if (first || access->offset >= high)
        {
          first = false;

Reply via email to