Zhenqiang's been working on the later split 2 patch which causes more
constants to be built using a movw/movt instead of a constant pool
load. There was an unexpected ~10 % regression in one benchmark which
seems to be due to function alignment. I think we've tracked down the
reason but not the a
A small case is attached to reproduce it.
Here are logs for different loop header alignment (default is 64):
linaro@Linaro-test:~$ gcc test1.c -o t.exe && time ./t.exe
real0m3.206s
user0m3.203s
sys 0m0.000s
linaro@Linaro-test:~$ gcc test1.c -DALIGNED_2 -o t.exe && time ./t.exe
real