[torqueusers] ha torque
Daniel Bourque
dbourque at weatherdata.com
Tue Apr 8 10:29:53 MDT 2008
Hi,
We're planning on setting up a torque/Maui cluster. I'm planning on
making the head node also be worker nodes, and for a 2nd worker node to
be a failover headnode.
My intent is to use heartbeat to control the state of torque, Maui and a
service IP.
Is this possible ?
what files need to be kept in sync ?
if the headnode fails, what happens to running jobs ?
if the headnode fails, when Maui start on the new headnode, will it
query the pbs_mom daemons on the worker nodes to get usage info ?
Thanks
--
Daniel Bourque
Sr. Systems Engineer
WeatherData Service Inc
An Accuweather Company
Office (316) 266-8013
Office (316) 265-9127 ext. 3013
Mobile (316) 640-1024
More information about the torqueusers
mailing list