On 15/07/2020 03:39, 夏 晋 via Gcc wrote:
Hi everyone,
I'm trying to autovectorize the loop, and Thank you for the omnipotent
macros, everything goes alright. But recently I need to further optimize the
loop, I had some problems.
As our vector instruction can process 16 numbers at the same
Hi everyone,
I'm trying to autovectorize the loop, and Thank you for the omnipotent
macros, everything goes alright. But recently I need to further optimize the
loop, I had some problems.
As our vector instruction can process 16 numbers at the same time, if the for
loop counter is equal or l