https://gcc.gnu.org/bugzilla/show_bug.cgi?id=104686
--- Comment #18 from Richard Biener <rguenth at gcc dot gnu.org> ---
(In reply to Richard Biener from comment #16)
> it doesn't make a difference for this testcase but profiling shows that
> allocnos_conflict_p is quite expensive so it's best to do it after the other
> continue checks like the following. I also notice that the comment of
> allocnos_conflict_p says
>
> /* Return TRUE if allocnos A1 and A2 conflicts. Here we are
> interesting only in conflicts of allocnos with intersected allocno
> classes. */
>
> so doing it after the ira_reg_classes_intersect_p check makes even more
> sense(?)
>
> diff --git a/gcc/ira-color.cc b/gcc/ira-color.cc
> index 8b6db1bb417..a5fd79484eb 100644
> --- a/gcc/ira-color.cc
> +++ b/gcc/ira-color.cc
> @@ -1572,15 +1572,14 @@ update_conflict_hard_regno_costs (int *costs, enum
> reg_class aclass,
> else
> gcc_unreachable ();
>
> + another_aclass = ALLOCNO_CLASS (another_allocno);
> if (another_allocno == from
> + || ALLOCNO_ASSIGNED_P (another_allocno)
> + || ALLOCNO_COLOR_DATA (another_allocno)->may_be_spilled_p
> + || ! ira_reg_classes_intersect_p[aclass][another_aclass]
> || allocnos_conflict_p (another_allocno, start))
> continue;
>
> - another_aclass = ALLOCNO_CLASS (another_allocno);
> - if (! ira_reg_classes_intersect_p[aclass][another_aclass]
> - || ALLOCNO_ASSIGNED_P (another_allocno)
> - || ALLOCNO_COLOR_DATA (another_allocno)->may_be_spilled_p)
> - continue;
> class_size = ira_class_hard_regs_num[another_aclass];
> ira_allocate_and_copy_costs
> (&ALLOCNO_UPDATED_CONFLICT_HARD_REG_COSTS (another_allocno),
>
>
> Now, what's more odd is that we sometimes have a nice bitmap representation
> for the conflicts but we always iterate. So it _seems_ we should be able
> to do sth like
>
> diff --git a/gcc/ira-color.cc b/gcc/ira-color.cc
> index 8b6db1bb417..682d1ef7562 100644
> --- a/gcc/ira-color.cc
> +++ b/gcc/ira-color.cc
> @@ -1352,9 +1352,23 @@ allocnos_conflict_p (ira_allocno_t a1, ira_allocno_t
> a2)
> {
> obj = ALLOCNO_OBJECT (a1, word);
> /* Take preferences of conflicting allocnos into account. */
> - FOR_EACH_OBJECT_CONFLICT (obj, conflict_obj, oci)
> - if (OBJECT_ALLOCNO (conflict_obj) == a2)
> - return true;
> + if (!OBJECT_CONFLICT_VEC_P (obj))
> + {
> + for (int w2 = 0; w2 < ALLOCNO_NUM_OBJECTS (a2); w2++)
> + {
> + ira_object_t obj2 = ALLOCNO_OBJECT (a2, w2);
> + if (OBJECT_CONFLICT_ID (obj2) >= OBJECT_MIN (obj)
> + && OBJECT_CONFLICT_ID (obj2) <= OBJECT_MAX (obj)
> + && TEST_MINMAX_SET_BIT (OBJECT_CONFLICT_BITVEC (obj),
> + OBJECT_CONFLICT_ID (obj2),
> + OBJECT_MIN (obj), OBJECT_MAX
> (obj)))
> + return true;
> + }
> + }
> + else
> + FOR_EACH_OBJECT_CONFLICT (obj, conflict_obj, oci)
> + if (OBJECT_ALLOCNO (conflict_obj) == a2)
> + return true;
> }
> return false;
> }
>
> which reduces compile-time from 10s to 1s for me ... the above should
> be split out so we can "optimally" use the bit test for
> object vs. allocno when possible.
>
> Vlad - any thoughts about the above two things? Shall I try to polish and
> optimize the bit test or would you be willing to pick those two speedups up?
Bootstrapped / tested ok on x86_64-unknown-linux-gnu.