The following tackles another source of slow bitmap operations,
namely populating blocks_to_update.  We already have that in
tree view around PHI insertion but also the initial population is
slow.  There's unfortunately a conditional inbetween list view
requirement and the bitmap API doesn't allow opportunistic
switching but rejects tree -> tree or list -> list transitions.
So the following patch wraps the early population in a tree view
section with possibly one redundant tree -> list -> tree view
transition.

This cuts tree SSA incremental from 228.25s (21%) to 65.05s (7%).

Bootstrapped and tested on x86_64-unknown-linux-gnu, pushed.

        PR tree-optimization/114855
        * tree-into-ssa.cc (update_ssa): Use tree view for the
        initial population of blocks_to_update.
---
 gcc/tree-into-ssa.cc | 5 +++++
 1 file changed, 5 insertions(+)

diff --git a/gcc/tree-into-ssa.cc b/gcc/tree-into-ssa.cc
index 1cce9d62809..fc61d47ca77 100644
--- a/gcc/tree-into-ssa.cc
+++ b/gcc/tree-into-ssa.cc
@@ -3445,6 +3445,7 @@ update_ssa (unsigned update_flags)
   blocks_with_phis_to_rewrite = BITMAP_ALLOC (NULL);
   bitmap_tree_view (blocks_with_phis_to_rewrite);
   blocks_to_update = BITMAP_ALLOC (NULL);
+  bitmap_tree_view (blocks_to_update);
 
   insert_phi_p = (update_flags != TODO_update_ssa_no_phi);
 
@@ -3492,6 +3493,8 @@ update_ssa (unsigned update_flags)
         placement heuristics.  */
       prepare_block_for_update (start_bb, insert_phi_p);
 
+      bitmap_list_view (blocks_to_update);
+
       tree name;
 
       if (flag_checking)
@@ -3517,6 +3520,8 @@ update_ssa (unsigned update_flags)
     }
   else
     {
+      bitmap_list_view (blocks_to_update);
+
       /* Otherwise, the entry block to the region is the nearest
         common dominator for the blocks in BLOCKS.  */
       start_bb = nearest_common_dominator_for_set (CDI_DOMINATORS,
-- 
2.43.0

Reply via email to