https://gcc.gnu.org/bugzilla/show_bug.cgi?id=99395
--- Comment #8 from JuzheZhong <juzhe.zhong at rivai dot ai> --- Hi, Richard. Now, I find the time to GCC vectorization optimization. I find this case: _2 = a[_1]; ... a[i_16] = _4; ,,, _7 = a[_1]; ---> This load should be eliminated and re-use _2. Am I right ? Could you guide me which pass should do this CSE optimization ? Thanks.