Re: Vectorization regression on s390x GCC6 vs GCC5

2017-01-26 Thread Richard Biener
On Thu, Jan 26, 2017 at 11:36 AM, Bin.Cheng wrote: > On Thu, Jan 26, 2017 at 10:18 AM, Robin Dapp wrote: >> Hi, >> >> while analyzing a test case with a lot of nested loops (>7) and double >> floating point operations I noticed a performance regression of GCC 6/7 >> vs GCC 5 on s390x. It seems du

Re: Vectorization regression on s390x GCC6 vs GCC5

2017-01-26 Thread Bin.Cheng
On Thu, Jan 26, 2017 at 10:18 AM, Robin Dapp wrote: > Hi, > > while analyzing a test case with a lot of nested loops (>7) and double > floating point operations I noticed a performance regression of GCC 6/7 > vs GCC 5 on s390x. It seems due to GCC 6 vectorizing something GCC 5 > couldn't. > Basic

Vectorization regression on s390x GCC6 vs GCC5

2017-01-26 Thread Robin Dapp
Hi, while analyzing a test case with a lot of nested loops (>7) and double floating point operations I noticed a performance regression of GCC 6/7 vs GCC 5 on s390x. It seems due to GCC 6 vectorizing something GCC 5 couldn't. Basically, each loop iterates over three dimensions, we fully unroll so