https://gcc.gnu.org/bugzilla/show_bug.cgi?id=113133
--- Comment #11 from Haochen Jiang <haochen.jiang at intel dot com> --- I just checked the code and pattern. I suppose the simple remove is reasonable here. We should only allow x/ymm16+ for scalar instructions, but not this pattern.