[torqueusers] Problem with job starts on Linux

Garrick Staples garrick at usc.edu
Thu Jun 28 17:12:40 MDT 2007


On Thu, Jun 28, 2007 at 11:49:53AM -0400, Chad Vizino alleged:
> 06/28/2007 11:39:22;0001;   pbs_mom;Job;6.heidi;phase 2 of job launch 
> successfully completed
> 06/28/2007 11:39:22;0001;   pbs_mom;Job;TMomFinalizeJob3;read start 
> return code=-2 session=26752
> 06/28/2007 11:39:22;0001;   pbs_mom;Job;TMomFinalizeJob3;job not 
> started, Failure job exec failure, after files staged, no retry

This means pbs_mom wasn't able to setup the child process to actually
start executing the job.  There are a variety of reasons, must of which
should be sent to syslog (the child process isn't allowed access to MOM
logs); so check /var/log/messages.

-- 
Garrick Staples, GNU/Linux HPCC SysAdmin
University of Southern California

Please avoid sending me Word or PowerPoint attachments.
See http://www.gnu.org/philosophy/no-word-attachments.html
09 F9 11 02 9D 74 E3 5B D8 41 56 C5 63 56 88 C0
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: not available
Url : http://www.supercluster.org/pipermail/torqueusers/attachments/20070628/b78ee15b/attachment.bin


More information about the torqueusers mailing list