Em Fri, Jun 16, 2017 at 01:16:55PM -0300, Arnaldo Carvalho de Melo escreveu:
> Em Wed, Jun 14, 2017 at 10:53:39AM +0800, Jin Yao escreveu:
> > Macro fusion merges two instructions to a single micro-op. Intel
> > core platform performs this hardware optimization under limited
> > circumstances. For example, CMP + JCC can be "fused" and executed
> > /retired together. While with sampling this can result in the
> > sample sometimes being on the JCC and sometimes on the CMP.
> > So for the fused instruction pair, they could be considered
> > together.
> > 
> > In general, the fused instruction pairs are:
> > 
> > cmp/test/add/sub/and/inc/dec + jcc.
> > 
> > This patch series marks the case clearly by joining the fused
> > instruction pair in the arrow of the jump.
> > 
> > For example:
> > 
> >        │   ┌──cmpl   $0x0,argp_program_version_hook
> >  81.93 │   │──je     20
> >        │   │  lock   cmpxchg %esi,0x38a9a4(%rip)
> >        │   │↓ jne    29
> >        │   │↓ jmp    43
> >  11.47 │20:└─→cmpxch %esi,0x38a999(%rip)
> 
> Try to have these example outputs in the changesets, not just in the
> patch series header.

Ok, I went trigger happy, sorry, it is in the second patch, I had looked
just at the first :-\

- Arnaldo
 
> - Arnaldo
>  
> > Jin Yao (2):
> >   perf report: Check for fused instruction pair
> >   perf report: Implement visual marker for macro fusion in annotate
> > 
> >  tools/perf/arch/x86/util/Build    |  1 +
> >  tools/perf/arch/x86/util/fused.c  | 20 ++++++++++++++++++++
> >  tools/perf/ui/browser.c           | 27 +++++++++++++++++++++++++++
> >  tools/perf/ui/browser.h           |  2 ++
> >  tools/perf/ui/browsers/annotate.c | 30 ++++++++++++++++++++++++++++++
> >  tools/perf/util/Build             |  1 +
> >  tools/perf/util/annotate.c        |  5 +++++
> >  tools/perf/util/annotate.h        |  1 +
> >  tools/perf/util/fused.c           | 11 +++++++++++
> >  tools/perf/util/fused.h           |  8 ++++++++
> >  10 files changed, 106 insertions(+)
> >  create mode 100644 tools/perf/arch/x86/util/fused.c
> >  create mode 100644 tools/perf/util/fused.c
> >  create mode 100644 tools/perf/util/fused.h
> > 
> > -- 
> > 2.7.4

Reply via email to