[torqueusers] ha torque

Steve Snelgrove ssnelgrove at clusterresources.com
Wed Apr 9 09:49:50 MDT 2008


The 2.3 release of Torque has support for HA by allowing two head node 
server to access the server_priv files on a shared file system.  See 
http://www.clusterresources.com/torquedocs21/4.3high-availability.shtml 
for more details.


Daniel Bourque wrote:
> Hi,
>
>    We're planning on setting up a torque/Maui cluster. I'm planning on 
> making the head node also be worker nodes, and for a 2nd worker node 
> to be a failover headnode.
>
> My intent is to use heartbeat to control the state of torque, Maui and 
> a service IP.
>
> Is this possible ?
>
> what files need to be kept in sync ?
>
> if the headnode fails, what happens to running jobs ?
>
> if the headnode fails, when Maui start on the new headnode, will it 
> query the pbs_mom daemons on the worker nodes to get usage info ?
>
> Thanks
>



More information about the torqueusers mailing list