I have a small testcase to show that enable FTZ/DAZ makes a huge (>160 times faster) difference on SSE floating point code. Icc enables it by defailt for -ON (N>=1). Should gcc do the same?
H.J.
I have a small testcase to show that enable FTZ/DAZ makes a huge (>160 times faster) difference on SSE floating point code. Icc enables it by defailt for -ON (N>=1). Should gcc do the same?
H.J.