pengfei added a comment. > These intrinsics are interesting. The descriptions on Intrinsic guide are for > immediate variant, but all compilers' implementations are register variant. > What's more, the codegen from Clang and GCC don't seem correct according to > the description of `vpsrlw zmm0, zmm0, xmm1`. They should do the same > broadcast as ICC. https://godbolt.org/z/dcrqdEs8q
After a second read, I found Clang and GCC's generation are also correct, I confused `vpsrlw zmm0, zmm0, xmm1` with `vpsrlw zmm0, zmm0, zmm1`. Please ignore the noise. Repository: rG LLVM Github Monorepo CHANGES SINCE LAST ACTION https://reviews.llvm.org/D125164/new/ https://reviews.llvm.org/D125164 _______________________________________________ cfe-commits mailing list cfe-commits@lists.llvm.org https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits