[torqueusers] Torque/Maui kills jobs running on the same node
evgeni.bezus at gmail.com
Thu Feb 18 23:45:43 MST 2010
We are running Maui and Torque on a 14-node cluster. Each node has 8 cores
(2 4-core processors). When running two (or more) jobs from a single
user on the same node, Maui(or Torque?) stops all the jobs when one of them is
finished. The finished job has Exit_status=0, killed jobs -
Exit_status=271. The value of the NODEACCESSPOLICY parameter in
maui.cfg is SHARED. This problem does not occur when running jobs from
a single user on different nodes or when running jobs from different
users on the same node.
Does anyone know how to resolve the problem?
More information about the torqueusers