[torqueusers] Problem with job starts on Linux
Garrick Staples
garrick at usc.edu
Thu Jun 28 17:12:40 MDT 2007
On Thu, Jun 28, 2007 at 11:49:53AM -0400, Chad Vizino alleged:
> 06/28/2007 11:39:22;0001; pbs_mom;Job;6.heidi;phase 2 of job launch
> successfully completed
> 06/28/2007 11:39:22;0001; pbs_mom;Job;TMomFinalizeJob3;read start
> return code=-2 session=26752
> 06/28/2007 11:39:22;0001; pbs_mom;Job;TMomFinalizeJob3;job not
> started, Failure job exec failure, after files staged, no retry
This means pbs_mom wasn't able to setup the child process to actually
start executing the job. There are a variety of reasons, must of which
should be sent to syslog (the child process isn't allowed access to MOM
logs); so check /var/log/messages.
--
Garrick Staples, GNU/Linux HPCC SysAdmin
University of Southern California
Please avoid sending me Word or PowerPoint attachments.
See http://www.gnu.org/philosophy/no-word-attachments.html
09 F9 11 02 9D 74 E3 5B D8 41 56 C5 63 56 88 C0
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: not available
Url : http://www.supercluster.org/pipermail/torqueusers/attachments/20070628/b78ee15b/attachment.bin
More information about the torqueusers
mailing list