On Wed, Jul 24, 2024 at 11:22 PM Kyrylo Tkachov <ktkac...@nvidia.com> wrote:
>
>
>
> > On 25 Jul 2024, at 04:14, Andrew Pinski <quic_apin...@quicinc.com> wrote:
> >
> > External email: Use caution opening links or attachments
> >
> >
> > This renames the patterns orn<mode>3 to iorn<mode>3 so it
> > matches the new optab that was added with r15-1890-gf379596e0ba99d.
> > Likewise for bic<mode>3 to andn<mode>3.
> >
> > Note the operand 1 and operand 2 are swapped from the original
> > patterns to match the optab now.
> >
> > Built and tested for aarch64-linux-gnu with no regression.
>
> Ok but...
>
> >
> > gcc/ChangeLog:
> >
> >        * config/aarch64/aarch64-simd.md
> >        (bic<mode>3<vczle><vczbe>): Rename to ...
> >        (andn<mode>3<vczle><vczbe>): This. Also swap operands.
> >        (orn<mode>3<vczle><vczbe>): Rename to ...
> >        (iorn<mode>3<vczle><vczbe>): This. Also swap operands.
> >        (vec_cmp<mode><v_int_equiv>): Update orn call to iorn
> >        and swap the last two arguments.
> >
> > gcc/testsuite/ChangeLog:
> >
> >        * g++.target/aarch64/vect_cmp-1.C: New test.
>
> … any reason why these are in g++.target instead of gcc.target?

Yes because currently the `(a > b) ? 0 : c` syntax is currently only
supported with the C++ front-end. See
https://gcc.gnu.org/pipermail/gcc-patches/2024-July/657682.html for
the RFC patch to add it for the C front-end.

Thanks,
Andrew Pinski

> Thanks,
> Kyrill
>
>
> >
> > Signed-off-by: Andrew Pinski <quic_apin...@quicinc.com>
> > ---
> > gcc/config/aarch64/aarch64-simd.md            | 20 +++++-----
> > gcc/testsuite/g++.target/aarch64/vect_cmp-1.C | 37 +++++++++++++++++++
> > 2 files changed, 47 insertions(+), 10 deletions(-)
> > create mode 100644 gcc/testsuite/g++.target/aarch64/vect_cmp-1.C
> >
> > diff --git a/gcc/config/aarch64/aarch64-simd.md 
> > b/gcc/config/aarch64/aarch64-simd.md
> > index bbeee221f37..459e11b09a1 100644
> > --- a/gcc/config/aarch64/aarch64-simd.md
> > +++ b/gcc/config/aarch64/aarch64-simd.md
> > @@ -322,21 +322,21 @@ (define_insn "aarch64_simd_mov_from_<mode>high"
> >   [(set_attr "length" "4")]
> > )
> >
> > -(define_insn "orn<mode>3<vczle><vczbe>"
> > +(define_insn "iorn<mode>3<vczle><vczbe>"
> >  [(set (match_operand:VDQ_I 0 "register_operand" "=w")
> > -       (ior:VDQ_I (not:VDQ_I (match_operand:VDQ_I 1 "register_operand" 
> > "w"))
> > -               (match_operand:VDQ_I 2 "register_operand" "w")))]
> > +       (ior:VDQ_I (not:VDQ_I (match_operand:VDQ_I 2 "register_operand" 
> > "w"))
> > +               (match_operand:VDQ_I 1 "register_operand" "w")))]
> >  "TARGET_SIMD"
> > - "orn\t%0.<Vbtype>, %2.<Vbtype>, %1.<Vbtype>"
> > + "orn\t%0.<Vbtype>, %1.<Vbtype>, %2.<Vbtype>"
> >   [(set_attr "type" "neon_logic<q>")]
> > )
> >
> > -(define_insn "bic<mode>3<vczle><vczbe>"
> > +(define_insn "andn<mode>3<vczle><vczbe>"
> >  [(set (match_operand:VDQ_I 0 "register_operand" "=w")
> > -       (and:VDQ_I (not:VDQ_I (match_operand:VDQ_I 1 "register_operand" 
> > "w"))
> > -               (match_operand:VDQ_I 2 "register_operand" "w")))]
> > +       (and:VDQ_I (not:VDQ_I (match_operand:VDQ_I 2 "register_operand" 
> > "w"))
> > +               (match_operand:VDQ_I 1 "register_operand" "w")))]
> >  "TARGET_SIMD"
> > - "bic\t%0.<Vbtype>, %2.<Vbtype>, %1.<Vbtype>"
> > + "bic\t%0.<Vbtype>, %1.<Vbtype>, %2.<Vbtype>"
> >   [(set_attr "type" "neon_logic<q>")]
> > )
> >
> > @@ -4064,7 +4064,7 @@ (define_expand "vec_cmp<mode><v_int_equiv>"
> >                                               tmp0, <V_INT_EQUIV>mode),
> >                               lowpart_subreg (<MODE>mode,
> >                                               tmp1, <V_INT_EQUIV>mode)));
> > -       emit_insn (gen_orn<v_int_equiv>3 (operands[0], tmp2, operands[0]));
> > +       emit_insn (gen_iorn<v_int_equiv>3 (operands[0], operands[0], tmp2));
> >       }
> >       break;
> >
> > @@ -4111,7 +4111,7 @@ (define_expand "vec_cmp<mode><v_int_equiv>"
> >       else if (code == UNEQ)
> >        {
> >          emit_insn (gen_aarch64_cmeq<mode> (tmp, operands[2], operands[3]));
> > -         emit_insn (gen_orn<v_int_equiv>3 (operands[0], operands[0], tmp));
> > +         emit_insn (gen_iorn<v_int_equiv>3 (operands[0], tmp, 
> > operands[0]));
> >        }
> >       break;
> >
> > diff --git a/gcc/testsuite/g++.target/aarch64/vect_cmp-1.C 
> > b/gcc/testsuite/g++.target/aarch64/vect_cmp-1.C
> > new file mode 100644
> > index 00000000000..b82d87827d3
> > --- /dev/null
> > +++ b/gcc/testsuite/g++.target/aarch64/vect_cmp-1.C
> > @@ -0,0 +1,37 @@
> > +/* { dg-do compile } */
> > +/* { dg-additional-options "-O2 -fdump-tree-optimized" } */
> > +/* { dg-final { check-function-bodies "**" "" "" { target { le } } } } */
> > +
> > +#pragma GCC target "+nosve"
> > +
> > +#define vect8 __attribute__((vector_size(8) ))
> > +
> > +/**
> > +**bar1:
> > +**     fcmgt   v([0-9]+).2s, v[0-9]+.2s, v[0-9]+.2s
> > +**     bic     v0.8b, v2.8b, v\1.8b
> > +**     ret
> > +*/
> > +extern "C"
> > +vect8 int bar1(vect8 float a, vect8 float b, vect8 int c)
> > +{
> > +  return (a > b) ? 0 : c;
> > +}
> > +
> > +/**
> > +**bar2:
> > +**     fcmgt   v([0-9]+).2s, v[0-9]+.2s, v[0-9]+.2s
> > +**     orn     v0.8b, v2.8b, v\1.8b
> > +**     ret
> > +*/
> > +extern "C"
> > +vect8 int bar2(vect8 float a, vect8 float b, vect8 int c)
> > +{
> > +  return (a > b) ? c : -1;
> > +}
> > +
> > +// We should produce a BIT_ANDC and BIT_IORC here.
> > +
> > +// { dg-final { scan-tree-dump ".BIT_ANDN " "optimized" } }
> > +// { dg-final { scan-tree-dump ".BIT_IORN " "optimized" } }
> > +
> > --
> > 2.43.0
> >
>

Reply via email to