https://gcc.gnu.org/bugzilla/show_bug.cgi?id=112697

--- Comment #7 from Martin Jambor <jamborm at gcc dot gnu.org> ---
Created attachment 56720
  --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=56720&action=edit
Perf annotate of milc built with r14-4972-g8aa47713701b1f

commit r14-4972-g8aa47713701b1f:

$ perf stat taskset -c 0 specinvoke

 Performance counter stats for 'taskset -c 0 specinvoke':

         272931.43 msec task-clock:u                     #    1.000 CPUs
utilized             
                 0      context-switches:u               #    0.000 /sec        
                 0      cpu-migrations:u                 #    0.000 /sec        
            472353      page-faults:u                    #    1.731 K/sec       
      886165387570      cycles:u                         #    3.247 GHz        
                (83.33%)
       31546898034      stalled-cycles-frontend:u        #    3.56% frontend
cycles idle        (83.33%)
      729878095777      stalled-cycles-backend:u         #   82.36% backend
cycles idle         (83.33%)
     1061779557370      instructions:u                   #    1.20  insn per
cycle            
                                                  #    0.69  stalled cycles per
insn     (83.33%)
       58797121078      branches:u                       #  215.428 M/sec      
                (83.33%)
           6960852      branch-misses:u                  #    0.01% of all
branches             (83.33%)

     272.967381843 seconds time elapsed

     268.718335000 seconds user
       4.212584000 seconds sys

$ perf record taskset -c 0 specinvoke
[ perf record: Woken up 167 times to write data ]
[ perf record: Captured and wrote 41.549 MB perf.data (1088982 samples) ]

$ perf report -n --percent-limit=1 --stdio
# To display the perf.data header info, please use --header/--header-only
options.
#
#
# Total Lost Samples: 0
#
# Samples: 1M of event 'cycles:Pu'
# Event count (approx.): 883903400858
#
# Overhead       Samples  Command          Shared Object           Symbol       
# ........  ............  ...............  ...................... 
......................................
#
    24.34%        260907  milc_base.mine-  milc_base.mine-lto-gen  [.]
add_force_to_mom
    18.01%        198287  milc_base.mine-  milc_base.mine-lto-gen  [.]
mult_su3_na
    17.45%        187529  milc_base.mine-  milc_base.mine-lto-gen  [.]
u_shift_fermion
    14.22%        155596  milc_base.mine-  milc_base.mine-lto-gen  [.]
mult_su3_nn
     5.61%         60601  milc_base.mine-  milc_base.mine-lto-gen  [.]
scalar_mult_add_su3_matrix
     4.35%         51034  milc_base.mine-  milc_base.mine-lto-gen  [.]
path_product
     4.24%         46032  milc_base.mine-  milc_base.mine-lto-gen  [.]
mult_su3_an
     2.99%         32624  milc_base.mine-  milc_base.mine-lto-gen  [.]
imp_gauge_force.constprop.0
     1.50%         16242  milc_base.mine-  milc_base.mine-lto-gen  [.]
compute_gen_staple
     1.35%         14580  milc_base.mine-  milc_base.mine-lto-gen  [.]
mult_su3_mat_vec_sum_4dir
     1.21%         12922  milc_base.mine-  milc_base.mine-lto-gen  [.]
make_anti_hermitian
     1.06%         11469  milc_base.mine-  milc_base.mine-lto-gen  [.]
mult_adj_su3_mat_4vec
     1.03%         11111  milc_base.mine-  libc.so.6               [.]
__memset_avx2_unaligned_erms


$ perf annotate -n --percent-limit=1 > ~/tmp/milc-perf-annotate-8aa47713701 
(gzipeped and attached)

Reply via email to