[Mauiusers] RM Failure - MOM rejected

Wightman wightman at clusterresources.com
Tue Mar 14 08:36:23 MST 2006


This problem is probably better addressed on the torqueusers list as it
is torque that is failing to start the job not maui.  Be sure and tell
them what version of torque you are using.

- Douglas

On Tue, 2006-03-14 at 05:32 -0800, Gaurav Chopra wrote:
> Hi
> 
> I submitted this job on the cluster and the job is deferred. Using 
> tracejob I get:
> 
> 03/14/2006 05:06:17  S    unable to run job, MOM rejected/rc=1
> _
> Using checkjob $PBS_ID_
> StartDate: -00:06:36  Tue Mar 14 05:06:18
> Total Tasks: 1
> 
> Req[0]  TaskCount: 1  Partition: ALL
> Network: [NONE]  Memory >= 0  Disk >= 0  Swap >= 0
> Opsys: [NONE]  Arch: [NONE]  Features: [NONE]
> 
> 
> IWD: [NONE]  Executable:  [NONE]
> Bypass: 0  StartCount: 2
> PartitionMask: [ALL]
> Flags:       RESTARTABLE
> 
> job is deferred.  Reason:  RMFailure  (cannot start job - RM failure, 
> rc: 15041, msg: 'Execution server rejected request MSG=send failed, 
> STARTING')
> Holds:    Defer  (hold reason:  RMFailure)
> PE:  1.00  StartPriority:  1
> cannot select job 99950 for partition DEFAULT (job hold active)
> 
> Please advice
> 
> Gaurav
> _______________________________________________
> mauiusers mailing list
> mauiusers at supercluster.org
> http://www.supercluster.org/mailman/listinfo/mauiusers



More information about the mauiusers mailing list