https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92772
--- Comment #3 from Andrew Stubbs <ams at gcc dot gnu.org> --- The GCN architecture can handle the masking, but I don't know how we'd represent or apply that in the middle end? I can probably implement extract_last, and that might be more efficient, but I don't see how that will help the logic? The convoluted extraction is working fine; it's the vector comparison that is misleading.