Re: [PATCH] [x86_64] Add flag to control tight loops alignment opt

2024-11-05 Thread Hongtao Liu
Tuesday, November 5, 2024 11:20 AM > > > > To: gcc-patches@gcc.gnu.org; hubi...@ucw.cz; Liu, Hongtao > > > > ; ubiz...@gmail.com > > > > Cc: ti...@zhaoxin.com; silviaz...@zhaoxin.com; loui...@zhaoxin.com; > > > > cobec...@zhaoxin.com > > >

Re: [PATCH] [x86_64] Add flag to control tight loops alignment opt

2024-11-05 Thread Hongtao Liu
t; From: MayShao-oc > > > > Sent: Tuesday, November 5, 2024 11:20 AM > > > > To: gcc-patches@gcc.gnu.org; hubi...@ucw.cz; Liu, Hongtao > > > > ; ubiz...@gmail.com > > > > Cc: ti...@zhaoxin.com; silviaz...@zhaoxin.com; loui...@zhaoxin.com; > > >

Re: [PATCH] [x86_64] Add flag to control tight loops alignment opt

2024-11-05 Thread Richard Biener
t; To: gcc-patches@gcc.gnu.org; hubi...@ucw.cz; Liu, Hongtao > > > ; ubiz...@gmail.com > > > Cc: ti...@zhaoxin.com; silviaz...@zhaoxin.com; loui...@zhaoxin.com; > > > cobec...@zhaoxin.com > > > Subject: [PATCH] [x86_64] Add flag to control tight loop

Re: [PATCH] [x86_64] Add flag to control tight loops alignment opt

2024-11-04 Thread Hongtao Liu
...@zhaoxin.com; silviaz...@zhaoxin.com; loui...@zhaoxin.com; > > cobec...@zhaoxin.com > > Subject: [PATCH] [x86_64] Add flag to control tight loops alignment opt > > > > Hi all: > > This patch add -malign-tight-loops flag to control > > pass_align_tight_l

RE: [PATCH] [x86_64] Add flag to control tight loops alignment opt

2024-11-04 Thread Liu, Hongtao
Subject: [PATCH] [x86_64] Add flag to control tight loops alignment opt > > Hi all: > This patch add -malign-tight-loops flag to control pass_align_tight_loops. > The motivation is that pass_align_tight_loops may cause performance > regression in nested loops. > >

[PATCH] [x86_64] Add flag to control tight loops alignment opt

2024-11-04 Thread MayShao-oc
Hi all: This patch add -malign-tight-loops flag to control pass_align_tight_loops. The motivation is that pass_align_tight_loops may cause performance regression in nested loops. The example code as follows: #define ITER 2 #define ITER_O 10 int i, j,k; int array[I