https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107757
--- Comment #2 from Jiu Fu Guo <guojiufu at gcc dot gnu.org> --- For the scenario of this PR, the 2insn sequence would be faster than non-prefixed loading. One finding may be useful: when building 64bit constants, I notice that loading may be faster sometimes if the constant needs more instruction to build from scratch. For example, the one prefix loading(e.g. pld) could be faster than 2insn sequence.