[torqueusers] job dieing immediately, 0 byte output file being produced
garrick at usc.edu
Tue Feb 23 10:13:36 MST 2010
On Tue, Feb 23, 2010 at 10:28:29AM -0600, Sabuj Pattanayek alleged:
> On Tue, Feb 23, 2010 at 10:03 AM, Garrick <garrick at usc.edu> wrote:
> > Check syslog on the node?
> Nothing showing any errors, the drives are not out of space on the
> server or the node.
> Btw, jobs that can't currently run are being queued, because somehow,
> jobs that were running are still running. If a job can run it
> basically is terminated immediately.
So something simple broke. ntp not in sync, passwd/group not in sync, homedirs
not mounting, etc.
Garrick Staples, GNU/Linux HPCC SysAdmin
University of Southern California
Life is Good!
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Size: 189 bytes
Desc: not available
Url : http://www.supercluster.org/pipermail/torqueusers/attachments/20100223/2dd3742b/attachment.bin
More information about the torqueusers