Earlier I had posted about one of our blades running 30-50% slower than other ones despite having identical hardware and OS. I followed the suggestions and compared cpu temperature, memory, dmesg and sysctl. Everything looks the same.

I then used "perf stat" to compare speed of pigz ( parralel gzip ). The results are quite interesting. Using one cpu, the slow blade is as fast as the rest! But as I use more cpus, the speed decreases linearly from 3.1Ghz to 0.4 Ghz. See snippets from "perf stat" command below. All tests were on /tmp to eliminate any nfs issue. And same behavior is observed with any multi-threaded program.

Healthy blade 1 cpu:

Performance counter stats for './pigz -p 1 some200MBfile':

       6441.560969 task-clock                #    1.001 CPUs utilized
    21,230,248,729 cycles                    #    3.296 GHz
       6.435670580 seconds time elapsed

Slow blade 1 cpu:

 Performance counter stats for './pigz -p 1 some200MBfile':

       6857.933315 task-clock                #    1.001 CPUs utilized
    21,412,281,401 cycles                    #    3.122 GHz
       6.851644289 seconds time elapsed

Healthy blade 20 cpus:

Performance counter stats for './pigz -p 1 some200MBfile':

       7570.967306 task-clock                #   16.367 CPUs utilized
    21,913,797,346 cycles                    #    2.894 GHz
       0.462575439 seconds time elapsed

Slow blade 20 cpus:

 Performance counter stats for './pigz -p 1 some200MBfile':

      63404.802003 task-clock                #   19.524 CPUs utilized
    24,834,879,081 cycles                    #    0.392 GHz
       3.247597619 seconds time elapsed



_______________________________________________
Beowulf mailing list, Beowulf@beowulf.org sponsored by Penguin Computing
To change your subscription (digest mode or unsubscribe) visit 
http://www.beowulf.org/mailman/listinfo/beowulf

Reply via email to