Tamar Christina <[email protected]> writes:
>> For variable-sized vectors, I suppose the question is whether the
>> first unequal element is found in the minimum vector length, or whether
>> it only occurs for larger lengths. In the former case we can fold at
>> compile time, but in the latter case we can't.
>>
>> So we probably do want the loop for variable-length vectors, up to
>> constant_lower_bound (CONST_VECTOR_NUNITS (...)).
>>
>
> Doesn't operand_equal already do this? it looks like the VLA handling
> In same_vector_encodings_p rejects vectors that are not the same size,
> which is good enough for this no? since I'm after strict equality.
But what I meant is that for VLA vectors, compile-time equality is
a tristate value: yes, no, or maybe.
E.g.:
{ 0, 0, 0, 0, 0, 0, 0, 0, ... }
is not equal to
{ 0, 0, 1, 1, 1, 1, 1, 1, ... }
if the runtime VL gives more than 2 elements, but they are equal if
the runtime VL gives 2 elements. In this case, we can't fold EQ to
false at compile time if the minimum length is 2 elements, but we can
if the minimum length is 4 elements.
Similarly:
{ 0, 0, 1, 1, 1, 1, 1, 1, ... }
is only conditionally not equal to:
{ 0, 0, 1, 1, 2, 2, 3, 3, ... }
It isn't the case that every encoded value has to be present in every
runtime vector. E.g. the series { 0, 1, 2, ... } exists for VNx2DI
(for INDEX Z0.D, #0, #1), even though there is never a "2" element for
the minimum vector length.
Thanks,
Richard
>
>
> Bootstrapped Regtested on aarch64-none-linux-gnu, arm-none-linux-gnueabihf,
> x86_64-pc-linux-gnu -m32, -m64 and no issues.
>
> Ok for master?
>
> Thanks,
> Tamar
>
> gcc/ChangeLog:
>
> * simplify-rtx.cc (simplify_context::simplify_unary_operation): Try
> simplifying operand.
> (simplify_const_relational_operation): Simplify vector EQ and NE.
> (test_vector_int_const_compare): New.
> (test_vector_ops): Use it.
>
>
> -- inline copy of patch --
>
> diff --git a/gcc/simplify-rtx.cc b/gcc/simplify-rtx.cc
> index
> a20a61c5dddbc80b23a9489d925a2c31b2163458..8ba5864efb33ffa5d1ced99f6a7d0c73e12560d5
> 100644
> --- a/gcc/simplify-rtx.cc
> +++ b/gcc/simplify-rtx.cc
> @@ -6354,6 +6354,19 @@ simplify_const_relational_operation (enum rtx_code
> code,
> return 0;
> }
>
> + /* Check if the operands are a vector EQ or NE comparison. */
> + if (VECTOR_MODE_P (mode)
> + && INTEGRAL_MODE_P (mode)
> + && GET_CODE (op0) == CONST_VECTOR
> + && GET_CODE (op1) == CONST_VECTOR
> + && (code == EQ || code == NE))
> + {
> + if (rtx_equal_p (op0, op1))
> + return code == EQ ? const_true_rtx : const0_rtx;
> + else
> + return code == NE ? const_true_rtx : const0_rtx;
> + }
> +
> /* We can't simplify MODE_CC values since we don't know what the
> actual comparison is. */
> if (GET_MODE_CLASS (GET_MODE (op0)) == MODE_CC)
> @@ -8797,6 +8810,34 @@ test_vector_subregs (machine_mode inner_mode)
> test_vector_subregs_stepped (inner_mode);
> }
>
> +/* Verify vector constant comparisons for EQ and NE. */
> +
> +static void
> +test_vector_int_const_compare (machine_mode mode)
> +{
> + rtx zeros = CONST0_RTX (mode);
> + rtx minusone = CONSTM1_RTX (mode);
> + rtx series_0_1 = gen_const_vec_series (mode, const0_rtx, const1_rtx);
> + ASSERT_RTX_EQ (const0_rtx,
> + simplify_const_relational_operation (EQ, mode, zeros,
> + CONST1_RTX (mode)));
> + ASSERT_RTX_EQ (const_true_rtx,
> + simplify_const_relational_operation (EQ, mode, zeros,
> + CONST0_RTX (mode)));
> + ASSERT_RTX_EQ (const_true_rtx,
> + simplify_const_relational_operation (EQ, mode, minusone,
> + CONSTM1_RTX (mode)));
> + ASSERT_RTX_EQ (const_true_rtx,
> + simplify_const_relational_operation (NE, mode, zeros,
> + CONST1_RTX (mode)));
> + ASSERT_RTX_EQ (const_true_rtx,
> + simplify_const_relational_operation (NE, mode, zeros,
> + series_0_1));
> + ASSERT_RTX_EQ (const0_rtx,
> + simplify_const_relational_operation (EQ, mode, zeros,
> + series_0_1));
> +}
> +
> /* Verify some simplifications involving vectors. */
>
> static void
> @@ -8814,6 +8855,7 @@ test_vector_ops ()
> {
> test_vector_ops_series (mode, scalar_reg);
> test_vector_subregs (mode);
> + test_vector_int_const_compare (mode);
> }
> test_vec_merge (mode);
> }