https://gcc.gnu.org/bugzilla/show_bug.cgi?id=99932
--- Comment #15 from Tom de Vries <vries at gcc dot gnu.org> --- (In reply to Tom de Vries from comment #14) > An observation when playing around with vector-length-128-4.c: Another observation: ... $L11: ld.u64 %r108,[%r109]; st.u64 [%r112],%r108; setp.lt.u32 %r111,%r110,3; add.u32 %r110,%r110,1; add.u64 %r109,%r109,8; add.u64 %r112,%r112,8; @ %r111 bra.uni $L11; ... The bra.uni in the broadcast loop is incorrect, it's used in a vector-neutered block.