[torqueusers] Mom spool filesystem full error even though the spool is not full

David Chin chindw at wfu.edu
Tue Jun 11 09:01:58 MDT 2013


Hi everyone:

I have an odd error that popped up this morning. About 8 nodes suddenly
came up with "torque spool filesystem full" even though the spool
filesystem (4 GiB) was completely empty, and there had not been any jobs at
all for > 12 hours prior to the error appearing.

We have $logevent set to 511, and there is the every-5-minute timestamp in
the log with the torque version number, but nothing else for the >12 hours
preceding the appearance of the error.

Restarting pbs_mom clears the problem.

We are running Torque 2.5.12 on RHEL6.

Any hints as to where to look next?

Thanks in advance,
    Dave

--
David Chin, Ph.D.
chindw at wfu.edu                  High Performance Computing Systems Analyst
Office: +1.336.758.2964         Wake Forest University
Mobile: +1.336.608.0793         Winston-Salem, NC
Email-to-txt: 3366080793 at mms.att.net           Google Talk: chindw at wfu.edu
Web: http://users.wfu.edu/chindw/  http://linuxfollies.blogspot.com/
     https://plus.google.com/108169173177119739731/about
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://www.supercluster.org/pipermail/torqueusers/attachments/20130611/de60b6d2/attachment.html 


More information about the torqueusers mailing list