[torqueusers] RM Failure

Bobby Brown bobby.brown at vanderbilt.edu
Fri Mar 4 12:45:58 MST 2005


We started seeing jobs that are blocked when there are plenty of free 
nodes and a checkjob reveals:

Messages:  cannot start job - RM failure, rc: 15041, msg: 'MSG=send 
failed, JOB_SUBSTATE_RUNNING' PE:  1.00 StartPriority: 6234

Any ideas?

Thanks
Bobby



More information about the torqueusers mailing list