Prentice:

What cpu or cpus, amount of memory and how many hard drives are on a single 
node.  I may have the power draw for that configuration.

Larry




-----Original Message-----
From: Beowulf [mailto:beowulf-boun...@beowulf.org] On Behalf Of 
beowulf-requ...@beowulf.org
Sent: Monday, July 28, 2014 12:00 PM
To: beowulf@beowulf.org
Subject: Beowulf Digest, Vol 125, Issue 13

Send Beowulf mailing list submissions to
        beowulf@beowulf.org

To subscribe or unsubscribe via the World Wide Web, visit
        http://www.beowulf.org/mailman/listinfo/beowulf
or, via email, send a message with subject or body 'help' to
        beowulf-requ...@beowulf.org

You can reach the person managing the list at
        beowulf-ow...@beowulf.org

When replying, please edit your Subject line so it is more specific than "Re: 
Contents of Beowulf digest..."


Today's Topics:

   1. Power draw of cluster nodes under heavy load (Prentice Bisbal)
   2. Re: Power draw of cluster nodes under heavy load (Jeff White)
   3. Re: Power draw of cluster nodes under heavy load
      (Michael Di Domenico)
   4. Re: Power draw of cluster nodes under heavy load (Mark Hahn)
   5. Re: Power draw of cluster nodes under heavy load (Prentice Bisbal)
   6. Re: Power draw of cluster nodes under heavy load (Prentice Bisbal)


----------------------------------------------------------------------

Message: 1
Date: Mon, 28 Jul 2014 10:51:12 -0400
From: Prentice Bisbal <prentice.bis...@rutgers.edu>
To: "beowulf@beowulf.org" <beowulf@beowulf.org>
Subject: [Beowulf] Power draw of cluster nodes under heavy load
Message-ID: <53d66360.1050...@rutgers.edu>
Content-Type: text/plain; charset=ISO-8859-1; format=flowed

Beowulfers,

Are any of you monitoring the power draw on your clusters? If so, can any of 
you provide me with some statistics on your power draw under heavy load? 
Ideally, I'm looking for the power load for a worst-case scenario, such as 
running HPL, on a per-rack basis. If you can provide me with the power draw and 
a description of the hardware, that would be great.

I have some numbers from a friend who lurks on this list, but the more data 
points I have, the better I can justify my power requirements for a new cluster 
purchase I'm working on.

--
Prentice



------------------------------

Message: 2
Date: Mon, 28 Jul 2014 13:29:35 -0400
From: Jeff White <jaw...@pitt.edu>
To: <beowulf@beowulf.org>
Subject: Re: [Beowulf] Power draw of cluster nodes under heavy load
Message-ID: <53d6887f.60...@pitt.edu>
Content-Type: text/plain; format=flowed; charset="ISO-8859-1"

Power draw will vary greatly depending on many factors.  Where I am at 
we currently have 16 racks of HPC equipment (compute nodes, storage, 
network gear, etc.) using about 140kVA but can use up to 160 kVA.  A 
single rack with 26 compute nodes each with 64 cores worth of AMD 6276 
(Supermicro boxes) is using about 18 kW across the PDUs, 3 phase at 240 
volts, with most of the nodes at 100% CPU usage.

Jeff White - GNU+Linux Systems Administrator
University of Pittsburgh - CSSD

On 07/28/2014 10:51 AM, Prentice Bisbal wrote:
> Beowulfers,
>
> Are any of you monitoring the power draw on your clusters? If so, can
> any of you provide me with some statistics on your power draw under
> heavy load? Ideally, I'm looking for the power load for a worst-case
> scenario, such as running HPL, on a per-rack basis. If you can provide
> me with the power draw and a description of the hardware, that would be
> great.
>
> I have some numbers from a friend who lurks on this list, but the more
> data points I have, the better I can justify my power requirements for a
> new cluster purchase I'm working on.
>


------------------------------

Message: 3
Date: Mon, 28 Jul 2014 13:38:21 -0400
From: Michael Di Domenico <mdidomeni...@gmail.com>
Cc: Beowulf Mailing List <beowulf@beowulf.org>
Subject: Re: [Beowulf] Power draw of cluster nodes under heavy load
Message-ID:
        <CABOsP2O7qNNHxC0nvUs=s22bkqfvbnfgrhahmht_ekp87oy...@mail.gmail.com>
Content-Type: text/plain; charset=ISO-8859-1

I can't go into specifics.  with HPL i usually can't top 80-85% of
full power draw per cabinet.  we have cabinets ranging from 17kw,
25kw, and 35kw.  some of our user codes can push the machines to 90%
of full load.  this is shown on both amd 4-socket nodes and intel
dual-socket with gpu's.  we're running 208v three phase to the racks
and 208v single phase to each server

On Mon, Jul 28, 2014 at 1:29 PM, Jeff White <jaw...@pitt.edu> wrote:
> Power draw will vary greatly depending on many factors.  Where I am at we
> currently have 16 racks of HPC equipment (compute nodes, storage, network
> gear, etc.) using about 140kVA but can use up to 160 kVA.  A single rack
> with 26 compute nodes each with 64 cores worth of AMD 6276 (Supermicro
> boxes) is using about 18 kW across the PDUs, 3 phase at 240 volts, with most
> of the nodes at 100% CPU usage.
>
> Jeff White - GNU+Linux Systems Administrator
> University of Pittsburgh - CSSD
>
>
> On 07/28/2014 10:51 AM, Prentice Bisbal wrote:
>>
>> Beowulfers,
>>
>> Are any of you monitoring the power draw on your clusters? If so, can
>> any of you provide me with some statistics on your power draw under
>> heavy load? Ideally, I'm looking for the power load for a worst-case
>> scenario, such as running HPL, on a per-rack basis. If you can provide
>> me with the power draw and a description of the hardware, that would be
>> great.
>>
>> I have some numbers from a friend who lurks on this list, but the more
>> data points I have, the better I can justify my power requirements for a
>> new cluster purchase I'm working on.
>>
> _______________________________________________
> Beowulf mailing list, Beowulf@beowulf.org sponsored by Penguin Computing
> To change your subscription (digest mode or unsubscribe) visit
> http://www.beowulf.org/mailman/listinfo/beowulf


------------------------------

Message: 4
Date: Mon, 28 Jul 2014 14:13:48 -0400 (EDT)
From: Mark Hahn <h...@mcmaster.ca>
To: Prentice Bisbal <prentice.bis...@rutgers.edu>
Cc: "beowulf@beowulf.org" <beowulf@beowulf.org>
Subject: Re: [Beowulf] Power draw of cluster nodes under heavy load
Message-ID:
        <alpine.lfd.2.02.1407281220230.30...@coffee.psychology.mcmaster.ca>
Content-Type: TEXT/PLAIN; charset=US-ASCII; format=flowed

> Are any of you monitoring the power draw on your clusters? If so, can any of 
> you provide me with some statistics on your power draw under heavy load?

good question; it's something that deserves more attention and coverage.

ATM, I can only provide one non-answer:

http://www.sharcnet.ca/~hahn/saw-power-by-node.png

this is active mixed-user load (45 unrelated users, approximately 85%
CPU utilization due to memory scheduling and job layout constraints). 
this an older cluster, HP dual-socket E5440 (2.833G) whose IPMI happens to
return nice power measures.


> Ideally, I'm looking for the power load for a worst-case scenario, such as 
> running HPL, on a per-rack basis.

I don't understand the "per-rack" part - aren't you interested in per-node?


> I have some numbers from a friend who lurks on this list, but the more data 
> points I have, the better I can justify my power requirements for a new 
> cluster purchase I'm working on.

my experience is that vendors are useless in this regard: they always want
to quote the PSU max rating, and then often don't even use the number right.
(ie, put all the low-dissipation stuff like networking together, etc.)

has anyone tried to rate the accuracy of vendor power calculators?
at least a few years ago, they were absurdly inflated.

regards, mark hahn.


------------------------------

Message: 5
Date: Mon, 28 Jul 2014 14:53:05 -0400
From: Prentice Bisbal <prentice.bis...@rutgers.edu>
To: Mark Hahn <h...@mcmaster.ca>
Cc: "beowulf@beowulf.org" <beowulf@beowulf.org>
Subject: Re: [Beowulf] Power draw of cluster nodes under heavy load
Message-ID: <53d69c11.6000...@rutgers.edu>
Content-Type: text/plain; charset=ISO-8859-1; format=flowed


On 07/28/2014 02:13 PM, Mark Hahn wrote:
>> Are any of you monitoring the power draw on your clusters? If so, can 
>> any of you provide me with some statistics on your power draw under 
>> heavy load?
>
> good question; it's something that deserves more attention and coverage.
>
> ATM, I can only provide one non-answer:
>
> http://www.sharcnet.ca/~hahn/saw-power-by-node.png
>
> this is active mixed-user load (45 unrelated users, approximately 85%
> CPU utilization due to memory scheduling and job layout constraints). 
> this an older cluster, HP dual-socket E5440 (2.833G) whose IPMI 
> happens to
> return nice power measures.

Thanks. That image is more helpful than you think - I didn't even think 
of using IPMI to report power consumption. Using that, I could run HPL 
on some nodes here and get measurements.
>
>
>> Ideally, I'm looking for the power load for a worst-case scenario, 
>> such as running HPL, on a per-rack basis.
>
> I don't understand the "per-rack" part - aren't you interested in 
> per-node?

Ideally, per-node is even better, but I figured most measurements would 
be at the PDU or circuit level, with one or two PDUs/Circuits per rack. 
I figured this is the granularity most people are measuring at, which is 
why I asked that way.
>
>
>> I have some numbers from a friend who lurks on this list, but the 
>> more data points I have, the better I can justify my power 
>> requirements for a new cluster purchase I'm working on.
>
> my experience is that vendors are useless in this regard: they always 
> want
> to quote the PSU max rating, and then often don't even use the number 
> right.
> (ie, put all the low-dissipation stuff like networking together, etc.)
>
> has anyone tried to rate the accuracy of vendor power calculators?
> at least a few years ago, they were absurdly inflated.

This is why I'm asking for actual, measured numbers. I read a whitepaper 
by APC or Raritan that said that if you go with the nameplate on a PDU, 
you can oversize your power requirements by a factor of 2x. For HPC, I 
imagine it wouldn't be that extreme, since cluster nodes tend to be at 
100% more of the time and therefore use more power. One vendor said they 
assume 60% - 90% of nameplate ratings when estimating power needs, which 
is still a pretty broad range.
>
> regards, mark hahn.



------------------------------

Message: 6
Date: Mon, 28 Jul 2014 14:55:38 -0400
From: Prentice Bisbal <prentice.bis...@rutgers.edu>
To: beowulf@beowulf.org
Subject: Re: [Beowulf] Power draw of cluster nodes under heavy load
Message-ID: <53d69caa.20...@rutgers.edu>
Content-Type: text/plain; charset=ISO-8859-1; format=flowed

On 07/28/2014 01:29 PM, Jeff White wrote:
> Power draw will vary greatly depending on many factors.  Where I am at 
> we currently have 16 racks of HPC equipment (compute nodes, storage, 
> network gear, etc.) using about 140kVA but can use up to 160 kVA.  A 
> single rack with 26 compute nodes each with 64 cores worth of AMD 6276 
> (Supermicro boxes) is using about 18 kW across the PDUs, 3 phase at 
> 240 volts, with most of the nodes at 100% CPU usage.

Agreed there's a lot of variability. Since I don't exactly what's going 
in my new space yet, I'm looking for everyone's input to come up with an 
average, or ballpark amount. the 5 - 10 kW one vendor specified seems 
waaaay too low for a rack of high-density HPC nodes running at or near 
100% utilization.
>
> Jeff White - GNU+Linux Systems Administrator
> University of Pittsburgh - CSSD
>
> On 07/28/2014 10:51 AM, Prentice Bisbal wrote:
>> Beowulfers,
>>
>> Are any of you monitoring the power draw on your clusters? If so, can
>> any of you provide me with some statistics on your power draw under
>> heavy load? Ideally, I'm looking for the power load for a worst-case
>> scenario, such as running HPL, on a per-rack basis. If you can provide
>> me with the power draw and a description of the hardware, that would be
>> great.
>>
>> I have some numbers from a friend who lurks on this list, but the more
>> data points I have, the better I can justify my power requirements for a
>> new cluster purchase I'm working on.
>>
> _______________________________________________
> Beowulf mailing list, Beowulf@beowulf.org sponsored by Penguin Computing
> To change your subscription (digest mode or unsubscribe) visit 
> http://www.beowulf.org/mailman/listinfo/beowulf



------------------------------

Subject: Digest Footer

_______________________________________________
Beowulf mailing list
Beowulf@beowulf.org
http://www.beowulf.org/mailman/listinfo/beowulf


------------------------------

End of Beowulf Digest, Vol 125, Issue 13
****************************************
_______________________________________________
Beowulf mailing list, Beowulf@beowulf.org sponsored by Penguin Computing
To change your subscription (digest mode or unsubscribe) visit 
http://www.beowulf.org/mailman/listinfo/beowulf

Reply via email to