https://gcc.gnu.org/bugzilla/show_bug.cgi?id=102239
--- Comment #9 from luoxhu at gcc dot gnu.org --- (In reply to Segher Boessenkool from comment #8) > (In reply to luoxhu from comment #6) > > > > foo: > > > > .LFB0: > > > > .cfi_startproc > > > > rldicr. 3,3,29,1 > > > > beq 0,.L2 > > > > > > This is fine, but only because it tests the EQ bit (not the LT or GT > > > bits). > > > So the generated RTL for this insn (the 2insn one) is not correct. > > > > The generated RTL in pr102239.c.300r.split2 is: > > > > (insn 32 8 33 2 (parallel [ > > (set (reg:CC 100 0 [123]) > > (compare:CC (and:DI (ashift:DI (reg:DI 3 3 [124]) > > (const_int 29 [0x1d])) > > (const_int -4611686018427387904 > > [0xc000000000000000])) > > (const_int 0 [0]))) > > (clobber (reg:DI 3 3 [125])) > > ]) "pr102239.c":4:6 238 {*rotldi3_mask_dot} > > (nil)) > > (insn 33 32 10 2 (set (reg:DI 3 3 [125]) > > (lshiftrt:DI (reg:DI 3 3 [125]) > > (const_int 29 [0x1d]))) "pr102239.c":4:6 278 {lshrdi3} > > (nil)) > > (jump_insn 10 33 11 2 (set (pc) > > (if_then_else (eq (reg:CC 100 0 [123]) > > (const_int 0 [0])) > > (label_ref 15) > > (pc))) "pr102239.c":4:6 868 {*cbranch} > > (int_list:REG_BR_PROB 536870916 (nil)) > > -> 15) > > So combine will have to look at insn 10 as well when it does the combination > (it often already does, via "other_insn") -- but also it does have to know > an "eq" is okay here, and that requires a new pattern. > > > rotldi3_mask_dot is what you mentioned in c#1, it is a shifted result and > > not matter for comparing to 0: > > It does matter, if what you are want to see is if it is smaller than zero or > greater than zero. CCmode includes those things. There is a CCEQmode for > if only the EQ bit is set correctly. Got it, thanks. As the example in c#7. If CCmode is LT, rotate data to highest bits will get negative result and set CR0 to negative, which is unexpected. > > > > *rotl<mode>3_mask_dot cannot do this either; the base and the dot2 of that > > > cannot be done, they return a shifted result, but that doesn't matter for > > > comparing it to 0. So we should add a specialised version. > > > > What specialized version to add? > > Some pattern that just does this as an rldicr, as a single insn. It will > have to be excluded by the 2insn thing (it is only a single insn itself!), > and it will have to have comparison mode CCEQ only. I was motivated by the clang code, and tried to rotate the data to LSB instead, it doesn't suffer from CCmode issue again? Will this be simpler than the combine & new pattern solution? diff --git a/gcc/config/rs6000/rs6000.c b/gcc/config/rs6000/rs6000.c index c9ce0550df1..d2a5b916b1d 100644 --- a/gcc/config/rs6000/rs6000.c +++ b/gcc/config/rs6000/rs6000.c @@ -11747,11 +11747,11 @@ rs6000_emit_2insn_and (machine_mode mode, rtx *operands, bool expand, int dot) } else { - rtx tmp = gen_rtx_ASHIFT (mode, operands[1], GEN_INT (shift)); - tmp = gen_rtx_AND (mode, tmp, GEN_INT (val << shift)); - emit_move_insn (operands[0], tmp); - tmp = gen_rtx_LSHIFTRT (mode, operands[0], GEN_INT (shift)); + rtx tmp = gen_rtx_LSHIFTRT (mode, operands[1], GEN_INT (ne)); + tmp = gen_rtx_AND (mode, tmp, GEN_INT (val >> ne)); rs6000_emit_dot_insn (operands[0], tmp, dot, dot ? operands[3] : 0); + tmp = gen_rtx_ASHIFT (mode, operands[0], GEN_INT (ne)); + emit_move_insn (operands[0], tmp); } return; RTL pr102239.c.300r.split2: (insn 32 8 33 2 (parallel [ (set (reg:CC 100 0 [123]) (compare:CC (and:DI (lshiftrt:DI (reg:DI 3 3 [124]) (const_int 33 [0x21])) (const_int 3 [0x3])) (const_int 0 [0]))) (clobber (reg:DI 3 3 [125])) ]) "pr102239.c":4:6 238 {*rotldi3_mask_dot} (nil)) (insn 33 32 10 2 (set (reg:DI 3 3 [125]) (ashift:DI (reg:DI 3 3 [125]) (const_int 33 [0x21]))) "pr102239.c":4:6 268 {ashldi3} (nil)) (jump_insn 10 33 11 2 (set (pc) (if_then_else (eq (reg:CC 100 0 [123]) (const_int 0 [0])) (label_ref 15) (pc))) "pr102239.c":4:6 868 {*cbranch} (int_list:REG_BR_PROB 536870916 (nil)) -> 15) ASM pr102239.s: foo: .LFB0: .cfi_startproc rldicl. 3,3,31,62 beq 0,.L2 #APP # 5 "pr102239.c" 1 # if # 0 "" 2 #NO_APP blr .p2align 4,,15 .L2: #APP