[torqueusers] Sometimes jobs exit immediately without messages

Garrick Staples garrick at usc.edu
Mon Jul 16 14:37:18 MDT 2007


On Mon, Jul 16, 2007 at 12:15:30PM +0000, Hans Meier alleged:
> I have a strange problem: Most of the time, the users can submit jobs
> and everything is working perfectly OK. However, from time to time the
> cluster does not execute the jobs. The user submits a job with "qsub",
> and it exits so quickly that it doesn't even show up in the list
> displayed by "qstat". It should be in the queue because "qsub" answers
> with the number of the job. There are no error files generated.I hope
> you can help me solve that problem, because this was the reason why I
> upgraded to the latest version of Torque and thought that this
> behavior would disappear :-(.Hans

Depending on exactly where the error occured, TORQUE should syslog
errors on the executing node, or send an email to the submitter.


-- 
Garrick Staples, GNU/Linux HPCC SysAdmin
University of Southern California

Please avoid sending me Word or PowerPoint attachments.
See http://www.gnu.org/philosophy/no-word-attachments.html
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: not available
Url : http://www.supercluster.org/pipermail/torqueusers/attachments/20070716/884db9a7/attachment.bin


More information about the torqueusers mailing list