Hi All,

Forgot to attch the patch.

2012/9/18 Yuri Rumyantsev <ysrum...@gmail.com>:
> Hi All,
>
> This patch aims to fix all stability issues related to using the first
> scheduler in gcc
> for x86 target (there several reported issues related to this problem).
>
> Main idea of this activity is mostly to provide user a possibility to
> safely turn on first scheduler for his codes. In some cases this could
> positively affect performance, especially for in-order Atom.
>
> Below is short description of proposed changes.
>
> Main idea of this patch is to restrict code motion of instructions
> having likely spilled HW registers as their operands - in general
> all these instructions are unloading of incoming function argument
> in the function entry and passing of outgoing function arguments.
> This is essential for correct scheduling since live range of such insns
> is not defined before register allocation phase.
> This done through 2 hooks:
>
> 1. ix86_adjust_priority which sets up the priority of moves from
> likely spilled HW registers to maximum that allows us to schedule
> such insns as soon as possible, i.e. all moves correspondent to
> incoming function arguments will be scheduled at the top of
> function entry and moves correspondent to function return
> value will be scheduled immediately after call.
> 2. ix86_dependencies_evaluation_hook which insert additional
> dependencies for outgoing function arguments passed in likely
> spilled HW registers to avoid their code motion. This is done
> through the following steps:
> - scan the current schedule region to find all call instructions in
>   reverse order;
> - find out the first argument that passed in likely spilled HW
>   register. Starting from it insert output dependency between it and
>   possible previous argument (it does not matter if it passed in likely
>   spilled register or not). This is done in add_parameter_dependencies;
>   after it all arguments starting from the first one that is passed in
>   likely spilled register are pairwise connected through output
>   dependency.
> - add dependencies to the first function argument on
>   the rest of instructions in the current block (until next call)  to avoid
>   intra block code motion.
> - to avoid possible interblock code motion we also check if the first
>   argument has dependee in another blocks and if so insert dependency to
>   the last non-jump set instruction.
>
> This patch was deeply tested on Atom (eembc_2_0, spec2000 in base/peak mode)
>  and Big Core (spec2006 in base/peak mode).
> Also gcc full bootstrapping with turned on 1st scheduler by default was done.
>
> Tested for i386 and x86-64, ok for trunk?
>
> ChangeLog:
>
> 2012-09-18  Yuri Rumyantsev  <ysrum...@gmail.com>
>
>         * config/i386/i386.c (ix86_dep_by_shift_count_body) : Add
>         check on reload_completed since it can be invoked before
>         register allocation phase in 1st scheduler.
>         (ia32_multipass_dfa_lookahead) : Do not use dfa_lookahead for 1st
>         Scheduler to save compile time.
>         (ix86_sched_reorder) : Do not perform ready list reordering for 1st
>         Scheduler to save compile time.
>         (insn_is_function_arg) : New function. Returns true if lhs of insn is
>         HW function argument register.
>         (add_parameter_dependencies) : New function. Add output dependencies
>         for chain of function adjacent arguments if only there is a move to
>         likely spilled HW registers. Return first argument if at least one
>         dependence was added or NULL otherwise.
>         (avoid_func_arg_motion) : New function. Add output or anti dependency
>         from insn to first_arg to restrict code motion.
>         (add_dependee_for_func_arg) : New function. Avoid cross block motion 
> of
>         function argument through adding dependency from the first non-jump
>         insn in bb.
>         (ix86_dependencies_evaluation_hook) : New function. Hook for 
> schedule1:
>         avoid motion of function arguments passed in passed in likely spilled
>         HW registers.
>         (ix86_adjust_priority) : New function. Hook for schedule1: set 
> priority
>         of moves from likely spilled HW registers to maximum to schedule them
>         as soon as possible.
>         (ix86_sched_init_global): Do not perform multipass scheduling for 1st
>         Scheduler to save compile time.

Attachment: i386-fschedule-insn_for_x86.diff
Description: Binary data

Reply via email to