[torqueusers] reducing energy usage of torque
Dr. Stephan Raub
raub at uni-duesseldorf.de
Tue Feb 28 07:21:30 MST 2012
all of our nodes (compute nodes and service nodes) are equipped with
IPMI-capable BMCs (Baseboard Management Controller) so that we can control
all aspects of power (including measuring the current power consumption,
turning it on or off, power cycles, etc
) from the batch server just by
using the ipmitools-package.
We have used this for controlling nodes within a torque/maui in the context
of a bachelor project. But our clusters is so busy all the time, that we
could not find a dramatic reduction of the over-all power consumption of the
cluster (including water cooling of the racks).
| | Dr. rer. nat. Stephan Raub
| | Dipl. Chem.
| | High-Performance-Computing
| | Zentrum für Informations- und Medientechnologie
| | Heinrich-Heine-Universität Düsseldorf
| | Universitätsstr. 1 / Raum 25.41.O2.25-2
| | 40225 Düsseldorf / Germany
| | Tel: +49-211-811-3911
| | Fax: +49-211-811-2539
Wichtiger Hinweis: Diese E-Mail kann Betriebs- oder Geschäftsgeheimnisse,
sonstige vertrauliche Informationen enthalten. Sollten Sie diese E-Mail
irrtümlich erhalten haben, ist Ihnen eine Kenntnisnahme des Inhalts, eine
Vervielfältigung oder Weitergabe der E-Mail ausdrücklich untersagt. Bitte
benachrichtigen Sie uns und vernichten Sie die empfangene E-Mail. Vielen
Important Note: This e-mail may contain trade secrets or privileged,
undisclosed or otherwise confidential information. If you have received this
e-mail in error, you are hereby notified that any review, copying or
distribution of it is strictly prohibited. Please inform us immediately and
destroy the original transmittal. Thank you for your cooperation.
Von: torqueusers-bounces at supercluster.org
[mailto:torqueusers-bounces at supercluster.org] Im Auftrag von Ryan Golhar
Gesendet: Dienstag, 28. Februar 2012 15:08
An: Torque Users Mailing List
Betreff: Re: [torqueusers] reducing energy usage of torque
What about cycling the power using a PDU?
On Tue, Feb 28, 2012 at 2:43 AM, Daniel Fernando Coimbra
<danielfcoimbra at gmail.com> wrote:
I assume that by "turning off" you mean actually power down the node. I
am just curious on how do you intend to power it up again later. I
suppose you could use something like WakeUp on Lan, but I never actually
got to test this kind of thing and don't know how it would behave on a
high traffic network (I suppose the network card doesn't keep it's IP
once it's in such state).
On 02/26/2012 08:24 PM, Arka Aloke Bhattacharya wrote:
> Hi everyone,
> I am a PhD student at UC Berkeley, and I wanted to add a "turning off
> idle/underutilized servers" feature to our 100 server torque+maui
> deployment. However, I want to implement this feature using only
> existing torque+ maui interfaces and extensions ( i,e _without
> modifying_ the torque or maui source code in any way ).
> My proposed way is to
> 1. monitor the maui queue length , and estimate the number of servers
> I can switch off.
> 2. I would then use "pbsnodes -o <nodename>" command to render a
> certain number of servers offline for scheduling.
> 3. A bash script would turn the servers off.
> The servers would be turned back on (and added to the torque nodes
> list) when the queue length increases beyond a certain threshold.
> I had two questions :
> 1. Is there any existing open source code which already implements the
> "turning off idle servers" functionality in torque ?
> 2. Are there complications that would arise if I implemented the
> "turning-off idle servers" feature in my proposed way ? [ e.g - Is it
> possible that after turning off servers, they would lose some state
> and hence would not get added to the torque <nodes_list> when turned
> back on? Are there long lived TCP connections which need to be
> restarted separately ? , etc ]
> It would be great if anyone could help.
> Thanks a lot,
> torqueusers mailing list
> torqueusers at supercluster.org
torqueusers mailing list
torqueusers at supercluster.org
-------------- next part --------------
An HTML attachment was scrubbed...
More information about the torqueusers