RE: [Patch] [AArch64] PR target 66049: fix add/extend gcc test suite failures

Kumar, Venkataramanan Fri, 22 May 2015 01:51:09 -0700

Hi Kyrill, 

Sorry for little delay in responding.


> -----Original Message-----
> From: Kyrill Tkachov [mailto:[email protected]]
> Sent: Tuesday, May 19, 2015 9:13 PM
> To: Kumar, Venkataramanan; James Greenhalgh; [email protected]
> Cc: Ramana Radhakrishnan; [email protected]; Marcus Shawcroft
> Subject: Re: [Patch] [AArch64] PR target 66049: fix add/extend gcc test suite
> failures
> 
> Hi Venkat,
> 
> On 19/05/15 16:37, Kumar, Venkataramanan wrote:
> > Hi Maintainers,
> >
> > Please find the attached patch, that fixes add/extend gcc test suite 
> > failures
> in Aarch64 target.
> > Ref: https://gcc.gnu.org/bugzilla/show_bug.cgi?id=66049
> >
> > These tests started to fail after we prevented combiner from converting
> shift RTX to mult RTX, when the RTX is not inside a memory operation
> (r222874) .
> > Now I have added new add/extend patterns which are based on shift
> operations,  to fix these cases.
> >
> > Testing status with the patch.
> >
> > (1) GCC bootstrap on AArch64 successful.
> > (2)  SPEC2006 INT runs did not show any degradation.
> 
> Does that mean there was no performance regression? Or no codegen
> difference?

Yes there was no performance regression.


> What I'd expect from this patch is that the codegen would be the same as
> before the combine patch
> (r222874). A performance difference can sometimes be hard to measure
> even at worse code quality.
> Can you please confirm that on SPEC2006 INT the adds and shifts are now
> back to being combined
> into a single instruction?

I  used -dp --save-temps to dump pattern names in assembly files. 
I used revision before combiner patch (r222873) and the patch on top of trunk 
(little older r223194) for comparison.

Quickly compared the counts generated for the below patterns from the SPEC INT 
binaries. 

*adds_<optab><mode>_multp2 vs *adds_<optab><ALLX:mode>_shft_<GPI:mode> 
*subs_<optab><mode>_multp2 vs *subs_<optab><ALLX:mode>_shft_<GPI:mode>
*add_uxt<mode>_multp2      vs  *add_uxt<mode>_shift2
*add_uxtsi_multp2_uxtw     vs  *add_uxtsi_shift2_uxtw
*sub_uxt<mode>_multp2      vs  *sub_uxt<mode>_shift2
*sub_uxtsi_multp2_uxtw     vs  *sub_uxtsi_shift2_uxtw
*adds_mul_imm_<mode>    vs     *adds_shift_imm_<mode> 
*subs_mul_imm_<mode>    vs     *subs_shift_imm_<mode>

Patterns are found in few benchmarks. The  generated counts are matching in the 
binaries compiled with the two revisions. 

 Also Looked at assembly generated and  the adds and shifts are combined 
properly using the new patterns. Please let me know if this is OK.

> 
> Thanks,
> Kyrill
> 
> > (3) gcc regression testing passed.
> >
> > (-----Snip-----)
> > # Comparing 3 common sum files
> > ## /bin/sh ./gcc-fsf-trunk/contrib/compare_tests  /tmp/gxx-sum1.24998
> /tmp/gxx-sum2.24998
> > Tests that now work, but didn't before:
> >
> > gcc.target/aarch64/adds1.c scan-assembler adds\tw[0-9]+, w[0-9]+, w[0-
> 9]+, lsl 3
> > gcc.target/aarch64/adds1.c scan-assembler adds\tx[0-9]+, x[0-9]+, x[0-9]+,
> lsl 3
> > gcc.target/aarch64/adds3.c scan-assembler-times adds\tx[0-9]+, x[0-9]+,
> x[0-9]+, sxtw 2
> > gcc.target/aarch64/extend.c scan-assembler add\tw[0-9]+,.*uxth #?1
> > gcc.target/aarch64/extend.c scan-assembler add\tx[0-9]+,.*uxtw #?3
> > gcc.target/aarch64/extend.c scan-assembler sub\tw[0-9]+,.*uxth #?1
> > gcc.target/aarch64/extend.c scan-assembler sub\tx[0-9]+,.*uxth #?1
> > gcc.target/aarch64/extend.c scan-assembler sub\tx[0-9]+,.*uxtw #?3
> > gcc.target/aarch64/subs1.c scan-assembler subs\tw[0-9]+, w[0-9]+, w[0-
> 9]+, lsl 3
> > gcc.target/aarch64/subs1.c scan-assembler subs\tx[0-9]+, x[0-9]+, x[0-9]+,
> lsl 3
> > gcc.target/aarch64/subs3.c scan-assembler-times subs\tx[0-9]+, x[0-9]+,
> x[0-9]+, sxtw 2
> >
> > # No differences found in 3 common sum files
> > (-----Snip-----)
> >
> > The patterns are fixing the regressing tests, so I have not added any new
> tests.
> > Regarding  removal of the old patterns based on "mults",  I am planning to
> do it as a separate work.
> >
> > Is this OK for trunk ?
> >
> > gcc/ChangeLog
> >
> > 2015-05-19  Venkataramanan Kumar  <[email protected]>
> >
> >          * config/aarch64/aarch64.md
> >          (*adds_shift_imm_<mode>):  New pattern.
> >          (*subs_shift_imm_<mode>):  Likewise.
> >          (*adds_<optab><ALLX:mode>_shift_<GPI:mode>):  Likewise.
> >          (*subs_<optab><ALLX:mode>_shift_<GPI:mode>): Likewise.
> >          (*add_uxt<mode>_shift2): Likewise.
> >          (*add_uxtsi_shift2_uxtw): Likewise.
> >         (*sub_uxt<mode>_shift2): Likewise.
> >         (*sub_uxtsi_shift2_uxtw): Likewise.
> >
> >
> > Regards,
> > Venkat.
> >
> >
> >
> >

Regards,
Venkat,

RE: [Patch] [AArch64] PR target 66049: fix add/extend gcc test suite failures

Reply via email to