On Thu, Jan 26, 2017 at 11:36 AM, Bin.Cheng wrote:
> On Thu, Jan 26, 2017 at 10:18 AM, Robin Dapp wrote:
>> Hi,
>>
>> while analyzing a test case with a lot of nested loops (>7) and double
>> floating point operations I noticed a performance regression of GCC 6/7
>> vs GCC 5 on s390x. It seems du
On Thu, Jan 26, 2017 at 10:18 AM, Robin Dapp wrote:
> Hi,
>
> while analyzing a test case with a lot of nested loops (>7) and double
> floating point operations I noticed a performance regression of GCC 6/7
> vs GCC 5 on s390x. It seems due to GCC 6 vectorizing something GCC 5
> couldn't.
> Basic
Hi,
while analyzing a test case with a lot of nested loops (>7) and double
floating point operations I noticed a performance regression of GCC 6/7
vs GCC 5 on s390x. It seems due to GCC 6 vectorizing something GCC 5
couldn't.
Basically, each loop iterates over three dimensions, we fully unroll
so