Re: Enabling -ftree-slp-vectorize on -O2/Os

2018-05-26 Thread Richard Biener
On May 27, 2018 1:25:25 AM GMT+02:00, Allan Sandfeld Jensen wrote: >On Sonntag, 27. Mai 2018 00:05:32 CEST Segher Boessenkool wrote: >> On Sat, May 26, 2018 at 11:32:29AM +0200, Allan Sandfeld Jensen >wrote: >> > I brought this subject up earlier, and was told to suggest it again >for >> > gcc 9,

Re: Enabling -ftree-slp-vectorize on -O2/Os

2018-05-26 Thread Segher Boessenkool
On Sun, May 27, 2018 at 01:25:25AM +0200, Allan Sandfeld Jensen wrote: > On Sonntag, 27. Mai 2018 00:05:32 CEST Segher Boessenkool wrote: > > On Sat, May 26, 2018 at 11:32:29AM +0200, Allan Sandfeld Jensen wrote: > > > I brought this subject up earlier, and was told to suggest it again for > > > gc

Re: Enabling -ftree-slp-vectorize on -O2/Os

2018-05-26 Thread Allan Sandfeld Jensen
On Sonntag, 27. Mai 2018 00:05:32 CEST Segher Boessenkool wrote: > On Sat, May 26, 2018 at 11:32:29AM +0200, Allan Sandfeld Jensen wrote: > > I brought this subject up earlier, and was told to suggest it again for > > gcc 9, so I have attached the preliminary changes. > > > > My studies have show

Re: [Aarch64] Vector Function Application Binary Interface Specification for OpenMP

2018-05-26 Thread Segher Boessenkool
On Sat, May 26, 2018 at 11:09:24AM +0100, Richard Sandiford wrote: > On the wider point about changing the way call clobber information > is represented: I agree it would be good to generalise what we have > now. But if possible I think we should avoid target hooks that take > a specific call, and

Re: Enabling -ftree-slp-vectorize on -O2/Os

2018-05-26 Thread Segher Boessenkool
On Sat, May 26, 2018 at 11:32:29AM +0200, Allan Sandfeld Jensen wrote: > I brought this subject up earlier, and was told to suggest it again for gcc > 9, > so I have attached the preliminary changes. > > My studies have show that with generic x86-64 optimization it reduces binary > size with ar

Re: PR80155: Code hoisting and register pressure

2018-05-26 Thread Bin.Cheng
On Fri, May 25, 2018 at 5:54 PM, Richard Biener wrote: > On May 25, 2018 6:57:13 PM GMT+02:00, Jeff Law wrote: >>On 05/25/2018 03:49 AM, Bin.Cheng wrote: >>> On Fri, May 25, 2018 at 10:23 AM, Prathamesh Kulkarni >>> wrote: On 23 May 2018 at 18:37, Jeff Law wrote: > On 05/23/2018 03:20

RISC-V ELF multilibs

2018-05-26 Thread Sebastian Huber
Hello, I built a riscv64-rtems5 GCC (it uses gcc/config/riscv/t-elf-multilib). The following multilibs are built: riscv64-rtems5-gcc -print-multi-lib .; rv32i/ilp32;@march=rv32i@mabi=ilp32 rv32im/ilp32;@march=rv32im@mabi=ilp32 rv32iac/ilp32;@march=rv32iac@mabi=ilp32 rv32imac/ilp32;@march=rv32ima

Re: Enabling -ftree-slp-vectorize on -O2/Os

2018-05-26 Thread Florian Weimer
* Allan Sandfeld Jensen: > Anythhing else I should test or report? Interaction with -mstackrealign on i386, where it is required for system libraries to support applications which use the legacy ABI without stack alignment if you compile with -msse2 or -march=x86-64 -mtune=generic (and -mfpmath=s

Re: Enabling -ftree-slp-vectorize on -O2/Os

2018-05-26 Thread Richard Biener
On May 26, 2018 11:32:29 AM GMT+02:00, Allan Sandfeld Jensen wrote: >I brought this subject up earlier, and was told to suggest it again for >gcc 9, >so I have attached the preliminary changes. > >My studies have show that with generic x86-64 optimization it reduces >binary >size with around 0.

Re: [Aarch64] Vector Function Application Binary Interface Specification for OpenMP

2018-05-26 Thread Richard Sandiford
Steve Ellcey writes: > On Wed, 2018-05-16 at 22:11 +0100, Richard Sandiford wrote: >>  >> TARGET_HARD_REGNO_CALL_PART_CLOBBERED is the only current way >> of saying that an rtl instruction preserves the low part of a >> register but clobbers the high part.  We would need something like >> Alan H's

Re: Why is REG_ALLOC_ORDER not defined on Aarch64

2018-05-26 Thread Richard Sandiford
Andrew Pinski writes: > On Fri, May 25, 2018 at 3:35 PM, Steve Ellcey wrote: >> I was curious if there was any reason that REG_ALLOC_ORDER is not >> defined for Aarch64. Has anyone tried this to see if it could help >> performance? It is defined for many other platforms. > > https://gcc.gnu.org

Enabling -ftree-slp-vectorize on -O2/Os

2018-05-26 Thread Allan Sandfeld Jensen
I brought this subject up earlier, and was told to suggest it again for gcc 9, so I have attached the preliminary changes. My studies have show that with generic x86-64 optimization it reduces binary size with around 0.5%, and when optimizing for x64 targets with SSE4 or better, it reduces bina