Hi Venkat,

On 22/05/15 09:50, Kumar, Venkataramanan wrote:
Hi Kyrill,

Sorry for little delay in responding.

-----Original Message-----
From: Kyrill Tkachov [mailto:kyrylo.tkac...@foss.arm.com]
Sent: Tuesday, May 19, 2015 9:13 PM
To: Kumar, Venkataramanan; James Greenhalgh; gcc-patches@gcc.gnu.org
Cc: Ramana Radhakrishnan; seg...@kernel.crashing.org; Marcus Shawcroft
Subject: Re: [Patch] [AArch64] PR target 66049: fix add/extend gcc test suite
failures

Hi Venkat,

On 19/05/15 16:37, Kumar, Venkataramanan wrote:
Hi Maintainers,

Please find the attached patch, that fixes add/extend gcc test suite failures
in Aarch64 target.
Ref: https://gcc.gnu.org/bugzilla/show_bug.cgi?id=66049

These tests started to fail after we prevented combiner from converting
shift RTX to mult RTX, when the RTX is not inside a memory operation
(r222874) .
Now I have added new add/extend patterns which are based on shift
operations,  to fix these cases.
Testing status with the patch.

(1) GCC bootstrap on AArch64 successful.
(2)  SPEC2006 INT runs did not show any degradation.
Does that mean there was no performance regression? Or no codegen
difference?
Yes there was no performance regression.


What I'd expect from this patch is that the codegen would be the same as
before the combine patch
(r222874). A performance difference can sometimes be hard to measure
even at worse code quality.
Can you please confirm that on SPEC2006 INT the adds and shifts are now
back to being combined
into a single instruction?
I  used -dp --save-temps to dump pattern names in assembly files.
I used revision before combiner patch (r222873) and the patch on top of trunk 
(little older r223194) for comparison.

Quickly compared the counts generated for the below patterns from the SPEC INT 
binaries.

*adds_<optab><mode>_multp2 vs *adds_<optab><ALLX:mode>_shft_<GPI:mode>
*subs_<optab><mode>_multp2 vs *subs_<optab><ALLX:mode>_shft_<GPI:mode>
*add_uxt<mode>_multp2        vs  *add_uxt<mode>_shift2
*add_uxtsi_multp2_uxtw     vs  *add_uxtsi_shift2_uxtw
*sub_uxt<mode>_multp2        vs  *sub_uxt<mode>_shift2
*sub_uxtsi_multp2_uxtw     vs  *sub_uxtsi_shift2_uxtw
*adds_mul_imm_<mode>      vs     *adds_shift_imm_<mode>
*subs_mul_imm_<mode>      vs     *subs_shift_imm_<mode>

Patterns are found in few benchmarks. The  generated counts are matching in the 
binaries compiled with the two revisions.

  Also Looked at assembly generated and  the adds and shifts are combined 
properly using the new patterns. Please let me know if this is OK.

Thanks for investigating this. I've run your patch on aarch64.exp and I see the 
add+shift/extend failures
that we were seeing go away (I'm sure you saw that as well ;).
Up to the maintainers to review the patch.

Thanks,
Kyrill


Thanks,
Kyrill

(3) gcc regression testing passed.

(-----Snip-----)
# Comparing 3 common sum files
## /bin/sh ./gcc-fsf-trunk/contrib/compare_tests  /tmp/gxx-sum1.24998
/tmp/gxx-sum2.24998
Tests that now work, but didn't before:

gcc.target/aarch64/adds1.c scan-assembler adds\tw[0-9]+, w[0-9]+, w[0-
9]+, lsl 3
gcc.target/aarch64/adds1.c scan-assembler adds\tx[0-9]+, x[0-9]+, x[0-9]+,
lsl 3
gcc.target/aarch64/adds3.c scan-assembler-times adds\tx[0-9]+, x[0-9]+,
x[0-9]+, sxtw 2
gcc.target/aarch64/extend.c scan-assembler add\tw[0-9]+,.*uxth #?1
gcc.target/aarch64/extend.c scan-assembler add\tx[0-9]+,.*uxtw #?3
gcc.target/aarch64/extend.c scan-assembler sub\tw[0-9]+,.*uxth #?1
gcc.target/aarch64/extend.c scan-assembler sub\tx[0-9]+,.*uxth #?1
gcc.target/aarch64/extend.c scan-assembler sub\tx[0-9]+,.*uxtw #?3
gcc.target/aarch64/subs1.c scan-assembler subs\tw[0-9]+, w[0-9]+, w[0-
9]+, lsl 3
gcc.target/aarch64/subs1.c scan-assembler subs\tx[0-9]+, x[0-9]+, x[0-9]+,
lsl 3
gcc.target/aarch64/subs3.c scan-assembler-times subs\tx[0-9]+, x[0-9]+,
x[0-9]+, sxtw 2
# No differences found in 3 common sum files
(-----Snip-----)

The patterns are fixing the regressing tests, so I have not added any new
tests.
Regarding  removal of the old patterns based on "mults",  I am planning to
do it as a separate work.
Is this OK for trunk ?

gcc/ChangeLog

2015-05-19  Venkataramanan Kumar  <venkataramanan.ku...@amd.com>

          * config/aarch64/aarch64.md
          (*adds_shift_imm_<mode>):  New pattern.
          (*subs_shift_imm_<mode>):  Likewise.
          (*adds_<optab><ALLX:mode>_shift_<GPI:mode>):  Likewise.
          (*subs_<optab><ALLX:mode>_shift_<GPI:mode>): Likewise.
          (*add_uxt<mode>_shift2): Likewise.
          (*add_uxtsi_shift2_uxtw): Likewise.
         (*sub_uxt<mode>_shift2): Likewise.
         (*sub_uxtsi_shift2_uxtw): Likewise.


Regards,
Venkat.




Regards,
Venkat,

Reply via email to