Hi Kyrill, Sorry for little delay in responding.
> -----Original Message----- > From: Kyrill Tkachov [mailto:kyrylo.tkac...@foss.arm.com] > Sent: Tuesday, May 19, 2015 9:13 PM > To: Kumar, Venkataramanan; James Greenhalgh; gcc-patches@gcc.gnu.org > Cc: Ramana Radhakrishnan; seg...@kernel.crashing.org; Marcus Shawcroft > Subject: Re: [Patch] [AArch64] PR target 66049: fix add/extend gcc test suite > failures > > Hi Venkat, > > On 19/05/15 16:37, Kumar, Venkataramanan wrote: > > Hi Maintainers, > > > > Please find the attached patch, that fixes add/extend gcc test suite > > failures > in Aarch64 target. > > Ref: https://gcc.gnu.org/bugzilla/show_bug.cgi?id=66049 > > > > These tests started to fail after we prevented combiner from converting > shift RTX to mult RTX, when the RTX is not inside a memory operation > (r222874) . > > Now I have added new add/extend patterns which are based on shift > operations, to fix these cases. > > > > Testing status with the patch. > > > > (1) GCC bootstrap on AArch64 successful. > > (2) SPEC2006 INT runs did not show any degradation. > > Does that mean there was no performance regression? Or no codegen > difference? Yes there was no performance regression. > What I'd expect from this patch is that the codegen would be the same as > before the combine patch > (r222874). A performance difference can sometimes be hard to measure > even at worse code quality. > Can you please confirm that on SPEC2006 INT the adds and shifts are now > back to being combined > into a single instruction? I used -dp --save-temps to dump pattern names in assembly files. I used revision before combiner patch (r222873) and the patch on top of trunk (little older r223194) for comparison. Quickly compared the counts generated for the below patterns from the SPEC INT binaries. *adds_<optab><mode>_multp2 vs *adds_<optab><ALLX:mode>_shft_<GPI:mode> *subs_<optab><mode>_multp2 vs *subs_<optab><ALLX:mode>_shft_<GPI:mode> *add_uxt<mode>_multp2 vs *add_uxt<mode>_shift2 *add_uxtsi_multp2_uxtw vs *add_uxtsi_shift2_uxtw *sub_uxt<mode>_multp2 vs *sub_uxt<mode>_shift2 *sub_uxtsi_multp2_uxtw vs *sub_uxtsi_shift2_uxtw *adds_mul_imm_<mode> vs *adds_shift_imm_<mode> *subs_mul_imm_<mode> vs *subs_shift_imm_<mode> Patterns are found in few benchmarks. The generated counts are matching in the binaries compiled with the two revisions. Also Looked at assembly generated and the adds and shifts are combined properly using the new patterns. Please let me know if this is OK. > > Thanks, > Kyrill > > > (3) gcc regression testing passed. > > > > (-----Snip-----) > > # Comparing 3 common sum files > > ## /bin/sh ./gcc-fsf-trunk/contrib/compare_tests /tmp/gxx-sum1.24998 > /tmp/gxx-sum2.24998 > > Tests that now work, but didn't before: > > > > gcc.target/aarch64/adds1.c scan-assembler adds\tw[0-9]+, w[0-9]+, w[0- > 9]+, lsl 3 > > gcc.target/aarch64/adds1.c scan-assembler adds\tx[0-9]+, x[0-9]+, x[0-9]+, > lsl 3 > > gcc.target/aarch64/adds3.c scan-assembler-times adds\tx[0-9]+, x[0-9]+, > x[0-9]+, sxtw 2 > > gcc.target/aarch64/extend.c scan-assembler add\tw[0-9]+,.*uxth #?1 > > gcc.target/aarch64/extend.c scan-assembler add\tx[0-9]+,.*uxtw #?3 > > gcc.target/aarch64/extend.c scan-assembler sub\tw[0-9]+,.*uxth #?1 > > gcc.target/aarch64/extend.c scan-assembler sub\tx[0-9]+,.*uxth #?1 > > gcc.target/aarch64/extend.c scan-assembler sub\tx[0-9]+,.*uxtw #?3 > > gcc.target/aarch64/subs1.c scan-assembler subs\tw[0-9]+, w[0-9]+, w[0- > 9]+, lsl 3 > > gcc.target/aarch64/subs1.c scan-assembler subs\tx[0-9]+, x[0-9]+, x[0-9]+, > lsl 3 > > gcc.target/aarch64/subs3.c scan-assembler-times subs\tx[0-9]+, x[0-9]+, > x[0-9]+, sxtw 2 > > > > # No differences found in 3 common sum files > > (-----Snip-----) > > > > The patterns are fixing the regressing tests, so I have not added any new > tests. > > Regarding removal of the old patterns based on "mults", I am planning to > do it as a separate work. > > > > Is this OK for trunk ? > > > > gcc/ChangeLog > > > > 2015-05-19 Venkataramanan Kumar <venkataramanan.ku...@amd.com> > > > > * config/aarch64/aarch64.md > > (*adds_shift_imm_<mode>): New pattern. > > (*subs_shift_imm_<mode>): Likewise. > > (*adds_<optab><ALLX:mode>_shift_<GPI:mode>): Likewise. > > (*subs_<optab><ALLX:mode>_shift_<GPI:mode>): Likewise. > > (*add_uxt<mode>_shift2): Likewise. > > (*add_uxtsi_shift2_uxtw): Likewise. > > (*sub_uxt<mode>_shift2): Likewise. > > (*sub_uxtsi_shift2_uxtw): Likewise. > > > > > > Regards, > > Venkat. > > > > > > > > Regards, Venkat,