[torqueusers] Torque/Maui kills jobs running on the same node

Evgeni Bezus evgeni.bezus at gmail.com
Thu Feb 18 23:45:43 MST 2010


Hi all,

We are running Maui and Torque on a 14-node cluster. Each node has 8 cores
(2 4-core processors). When running two (or more) jobs from a single
user on the same node, Maui(or Torque?) stops all the jobs when one of them is
finished. The finished job has Exit_status=0, killed jobs -
Exit_status=271. The value of the NODEACCESSPOLICY parameter in
maui.cfg is SHARED. This problem does not occur when running jobs from
a single user on different nodes or when running jobs from different
users on the same node.

Does anyone know how to resolve the problem?


More information about the torqueusers mailing list