On Fri, Jul 10, 2015 at 7:58 PM, H.J. Lu <hjl.to...@gmail.com> wrote: > On Fri, Jul 10, 2015 at 10:21 AM, Uros Bizjak <ubiz...@gmail.com> wrote: >> On Fri, Jul 10, 2015 at 7:10 PM, H.J. Lu <hjl.to...@gmail.com> wrote: >>> On Fri, Jul 10, 2015 at 9:30 AM, Uros Bizjak <ubiz...@gmail.com> wrote: >>>> On Thu, Jul 9, 2015 at 12:54 PM, H.J. Lu <hjl.to...@gmail.com> wrote: >>>>> Indirect sibcall with register arguments is OK when there is register >>>>> available for argument passing. >>>>> >>>>> OK for trunk if there is no regression? >>>>> >>>>> >>>>> H.J. >>>>> --- >>>>> gcc/ >>>>> >>>>> PR target/66819 >>>>> * config/i386/i386.c (ix86_function_ok_for_sibcall): Allow >>>>> indirect sibcall with register arguments if register available >>>>> for argument passing. >>>>> (init_cumulative_args): Set cfun->machine->arg_reg_available_p >>>>> to cum->nregs != 0. >> >> Please update the above entry for nregs > 0. >> >>>>> (function_arg_advance_32): Set cfun->machine->arg_reg_available_p >>>>> to 0 when setting cum->nregs = 0. >>>> >>>> Do we also need similar functionality for 64bit ABIs? What happens if >>>> we are out of argument regs there? >>> >>> 64-bit is OK since we have rax, r10 and r11 as scratch registers which >>> aren't used to pass arguments. >> >> Maybe this fact should be added as a comment in some appropriate place. >> >>>>> * config/i386/i386.h (machine_function): Add arg_reg_available_p. >>>>> >>>>> gcc/testsuite/ >>>>> >>>>> PR target/66819 >>>>> * gcc.target/i386/pr66819-1.c: New test. >>>>> * gcc.target/i386/pr66819-2.c: Likewise. >>>>> * gcc.target/i386/pr66819-3.c: Likewise. >>>>> * gcc.target/i386/pr66819-4.c: Likewise. >>>>> * gcc.target/i386/pr66819-5.c: Likewise. >>>>> --- >>>>> gcc/config/i386/i386.c | 15 +++++++++------ >>>>> gcc/config/i386/i386.h | 3 +++ >>>>> gcc/testsuite/gcc.target/i386/pr66819-1.c | 8 ++++++++ >>>>> gcc/testsuite/gcc.target/i386/pr66819-2.c | 8 ++++++++ >>>>> gcc/testsuite/gcc.target/i386/pr66819-3.c | 10 ++++++++++ >>>>> gcc/testsuite/gcc.target/i386/pr66819-4.c | 12 ++++++++++++ >>>>> gcc/testsuite/gcc.target/i386/pr66819-5.c | 10 ++++++++++ >>>>> 7 files changed, 60 insertions(+), 6 deletions(-) >>>>> create mode 100644 gcc/testsuite/gcc.target/i386/pr66819-1.c >>>>> create mode 100644 gcc/testsuite/gcc.target/i386/pr66819-2.c >>>>> create mode 100644 gcc/testsuite/gcc.target/i386/pr66819-3.c >>>>> create mode 100644 gcc/testsuite/gcc.target/i386/pr66819-4.c >>>>> create mode 100644 gcc/testsuite/gcc.target/i386/pr66819-5.c >>>>> >>>>> diff --git a/gcc/config/i386/i386.c b/gcc/config/i386/i386.c >>>>> index 54ee6f3..85e59a8 100644 >>>>> --- a/gcc/config/i386/i386.c >>>>> +++ b/gcc/config/i386/i386.c >>>>> @@ -5628,12 +5628,12 @@ ix86_function_ok_for_sibcall (tree decl, tree exp) >>>>> if (!decl >>>>> || (TARGET_DLLIMPORT_DECL_ATTRIBUTES && DECL_DLLIMPORT_P >>>>> (decl))) >>>>> { >>>>> - if (ix86_function_regparm (type, NULL) >= 3) >>>>> - { >>>>> - /* ??? Need to count the actual number of registers to be >>>>> used, >>>>> - not the possible number of registers. Fix later. */ >>>>> - return false; >>>>> - } >>>>> + /* FIXME: The symbol indirect call doesn't need a >>>>> + call-clobbered register. But we don't know if >>>>> + this is a symbol indirect call or not here. */ >>>>> + if (ix86_function_regparm (type, NULL) >= 3 >>>>> + && !cfun->machine->arg_reg_available_p) >>>> >>>> Isn't enough to look at arg_reg_available here? >>> >>> We need to check ix86_function_regparm since nregs is 0 if >>> -mregparm=N isn't used and pr65753.c will fail. >> >> OK. Please add this comment, is not that obvious. >> >>> >>>>> + return false; >>>>> } >>>>> } >>>>> >>>>> @@ -6567,6 +6567,7 @@ init_cumulative_args (CUMULATIVE_ARGS *cum, /* >>>>> Argument info to initialize */ >>>>> ? X86_64_REGPARM_MAX >>>>> : X86_64_MS_REGPARM_MAX); >>>>> } >>>>> + cfun->machine->arg_reg_available_p = cum->nregs != 0; >>>> >>>> false instead of 0. This is a boolean. >>> >>> Updated. >>> >>>>> if (TARGET_SSE) >>>>> { >>>>> cum->sse_nregs = SSE_REGPARM_MAX; >>>>> @@ -6636,6 +6637,7 @@ init_cumulative_args (CUMULATIVE_ARGS *cum, /* >>>>> Argument info to initialize */ >>>>> else >>>>> cum->nregs = ix86_function_regparm (fntype, fndecl); >>>>> } >>>>> + cfun->machine->arg_reg_available_p = cum->nregs != 0; >>>> >>>> IMO, cum->nregs > 0 would be more descriptive. >>> >>> Updated. >>> >>>>> /* Set up the number of SSE registers used for passing SFmode >>>>> and DFmode arguments. Warn for mismatching ABI. */ >>>>> @@ -7584,6 +7586,7 @@ pass_in_reg: >>>>> { >>>>> cum->nregs = 0; >>>>> cum->regno = 0; >>>>> + cfun->machine->arg_reg_available_p = 0; >>>>> } >>>>> break; >>>>> >>>>> diff --git a/gcc/config/i386/i386.h b/gcc/config/i386/i386.h >>>>> index 74334ff..0b6e304 100644 >>>>> --- a/gcc/config/i386/i386.h >>>>> +++ b/gcc/config/i386/i386.h >>>>> @@ -2479,6 +2479,9 @@ struct GTY(()) machine_function { >>>>> /* If true, it is safe to not save/restore DRAP register. */ >>>>> BOOL_BITFIELD no_drap_save_restore : 1; >>>>> >>>>> + /* If true, there is register available for argument passing. */ >>>>> + BOOL_BITFIELD arg_reg_available_p : 1; >>>> >>>> This is not a predicate, but a boolean flag. Please remove _p from the >>>> name. >>> >>> Updated. >>> >>> Here is the updated patch. OK for trunk? >> >> OK with a small comment additions. >> >> + /* If true, there is register available for argument passing. */ >> + BOOL_BITFIELD arg_reg_available : 1; >> + >> >> Please mention here that this is for 32bit targets only. >> > > Updated. Is this one OK?
LGTM. OK for mainline. Thanks, Uros.