https://gcc.gnu.org/bugzilla/show_bug.cgi?id=103218

            Bug ID: 103218
           Summary: (a < 0) << signbit is not always optimized to a &
                    signbitmask at the gimple level
           Product: gcc
           Version: 12.0
            Status: UNCONFIRMED
          Keywords: missed-optimization, TREE
          Severity: enhancement
          Priority: P3
         Component: tree-optimization
          Assignee: unassigned at gcc dot gnu.org
          Reporter: pinskia at gcc dot gnu.org
  Target Milestone: ---

Take:
int f(signed char a)
{
  signed char t = a < 0;
  return (unsigned char)(t << 7);
}

At the gimple level we get:
int f(signed char a)
{
  signed char t = a < 0;
  return (unsigned char)(t << 7);
}
But combine is able to it:
Trying 9 -> 10:
    9: {r89:QI=r91:SI#0 0>>0x7;clobber flags:CC;}
      REG_DEAD r91:SI
      REG_UNUSED flags:CC
   10: {r90:QI=r89:QI<<0x7;clobber flags:CC;}
      REG_DEAD r89:QI
      REG_UNUSED flags:CC
Successfully matched this instruction:
(parallel [
        (set (reg:QI 90)
            (and:QI (subreg:QI (reg:SI 91) 0)
                (const_int -128 [0xffffffffffffff80])))
        (clobber (reg:CC 17 flags))
    ])
allowing combination of insns 9 and 10
original costs 4 + 4 = 8
replacement cost 4
deferring deletion of insn with uid = 9.
modifying insn i3    10: {r90:QI=r91:SI#0&0xffffffffffffff80;clobber flags:CC;}
      REG_DEAD r91:SI
      REG_UNUSED flags:CC
deferring rescan insn with uid = 10.

If we had wrote the testcase like:
int f(signed char a)
{
  return (a < 0) << 7;
}

GCC does optimize it to:
(int) NON_LVALUE_EXPR <a> & 128

Reply via email to