On 07/28/2014 02:13 PM, Mark Hahn wrote:
Are any of you monitoring the power draw on your clusters? If so, can
any of you provide me with some statistics on your power draw under
heavy load?
good question; it's something that deserves more attention and coverage.
ATM, I can only provide one non-answer:
http://www.sharcnet.ca/~hahn/saw-power-by-node.png
this is active mixed-user load (45 unrelated users, approximately 85%
CPU utilization due to memory scheduling and job layout constraints).
this an older cluster, HP dual-socket E5440 (2.833G) whose IPMI
happens to
return nice power measures.
Thanks. That image is more helpful than you think - I didn't even think
of using IPMI to report power consumption. Using that, I could run HPL
on some nodes here and get measurements.
Ideally, I'm looking for the power load for a worst-case scenario,
such as running HPL, on a per-rack basis.
I don't understand the "per-rack" part - aren't you interested in
per-node?
Ideally, per-node is even better, but I figured most measurements would
be at the PDU or circuit level, with one or two PDUs/Circuits per rack.
I figured this is the granularity most people are measuring at, which is
why I asked that way.
I have some numbers from a friend who lurks on this list, but the
more data points I have, the better I can justify my power
requirements for a new cluster purchase I'm working on.
my experience is that vendors are useless in this regard: they always
want
to quote the PSU max rating, and then often don't even use the number
right.
(ie, put all the low-dissipation stuff like networking together, etc.)
has anyone tried to rate the accuracy of vendor power calculators?
at least a few years ago, they were absurdly inflated.
This is why I'm asking for actual, measured numbers. I read a whitepaper
by APC or Raritan that said that if you go with the nameplate on a PDU,
you can oversize your power requirements by a factor of 2x. For HPC, I
imagine it wouldn't be that extreme, since cluster nodes tend to be at
100% more of the time and therefore use more power. One vendor said they
assume 60% - 90% of nameplate ratings when estimating power needs, which
is still a pretty broad range.
regards, mark hahn.
_______________________________________________
Beowulf mailing list, Beowulf@beowulf.org sponsored by Penguin Computing
To change your subscription (digest mode or unsubscribe) visit
http://www.beowulf.org/mailman/listinfo/beowulf