https://gcc.gnu.org/bugzilla/show_bug.cgi?id=96377
--- Comment #3 from Jakub Jelinek <jakub at gcc dot gnu.org> --- For those that need a quick workaround for the kernel, I think (uint8x16_t) (k ^ vld1q_u8(const0)), (uint8x16_t) (k ^ vld1q_u8(const1)), instead of k ^ vld1q_u8(const0), k ^ vld1q_u8(const1), will do the job. And it is a question if portable code can use binary operators on the ARM specific vector types rather than intrinsics or the normal generic vectors instead.