Re: [PATCH] dse: Remove partial load after full store for high part access[PR71309]

2020-07-21 Thread Segher Boessenkool
On Tue, Jul 21, 2020 at 06:37:29PM -0400, David Edelsohn wrote: > On Tue, Jul 21, 2020 at 5:54 PM Segher Boessenkool > wrote: > > always (the target cannot run the resulting code, but we have many other > > options like that, starting with -mcpu=). David, what is your > > preference? > > > > The

Re: [PATCH] dse: Remove partial load after full store for high part access[PR71309]

2020-07-21 Thread David Edelsohn via Gcc-patches
On Tue, Jul 21, 2020 at 5:54 PM Segher Boessenkool wrote: > > Hi! > > On Tue, Jul 21, 2020 at 05:54:27AM -0500, Xiong Hu Luo wrote: > > --- a/gcc/testsuite/gcc.target/powerpc/fold-vec-extract-short.p7.c > > +++ b/gcc/testsuite/gcc.target/powerpc/fold-vec-extract-short.p7.c > > @@ -3,7 +3,7 @@ > >

Re: [PATCH] dse: Remove partial load after full store for high part access[PR71309]

2020-07-21 Thread Segher Boessenkool
Hi! On Tue, Jul 21, 2020 at 05:54:27AM -0500, Xiong Hu Luo wrote: > --- a/gcc/testsuite/gcc.target/powerpc/fold-vec-extract-short.p7.c > +++ b/gcc/testsuite/gcc.target/powerpc/fold-vec-extract-short.p7.c > @@ -3,7 +3,7 @@ > > /* { dg-do compile { target { powerpc*-*-linux* } } } */ > /* { dg-r

Re: [PATCH] dse: Remove partial load after full store for high part access[PR71309]

2020-07-21 Thread Richard Sandiford
Xiong Hu Luo writes: > This patch could optimize (works for char/short/int/void*): > > 6: r119:TI=[r118:DI+0x10] > 7: [r118:DI]=r119:TI > 8: r121:DI=[r118:DI+0x8] > > => > > 6: r119:TI=[r118:DI+0x10] > 16: r122:DI=r119:TI#8 > > Final ASM will be as below without partial load after full store(stxv+

[PATCH] dse: Remove partial load after full store for high part access[PR71309]

2020-07-21 Thread Xiong Hu Luo via Gcc-patches
This patch could optimize (works for char/short/int/void*): 6: r119:TI=[r118:DI+0x10] 7: [r118:DI]=r119:TI 8: r121:DI=[r118:DI+0x8] => 6: r119:TI=[r118:DI+0x10] 16: r122:DI=r119:TI#8 Final ASM will be as below without partial load after full store(stxv+ld): ld 10,16(3) mr 9,3 ld 3,24(3)