[torqueusers] Mom spool filesystem full error even though the spool is not full
chindw at wfu.edu
Tue Jun 11 09:01:58 MDT 2013
I have an odd error that popped up this morning. About 8 nodes suddenly
came up with "torque spool filesystem full" even though the spool
filesystem (4 GiB) was completely empty, and there had not been any jobs at
all for > 12 hours prior to the error appearing.
We have $logevent set to 511, and there is the every-5-minute timestamp in
the log with the torque version number, but nothing else for the >12 hours
preceding the appearance of the error.
Restarting pbs_mom clears the problem.
We are running Torque 2.5.12 on RHEL6.
Any hints as to where to look next?
Thanks in advance,
David Chin, Ph.D.
chindw at wfu.edu High Performance Computing Systems Analyst
Office: +1.336.758.2964 Wake Forest University
Mobile: +1.336.608.0793 Winston-Salem, NC
Email-to-txt: 3366080793 at mms.att.net Google Talk: chindw at wfu.edu
Web: http://users.wfu.edu/chindw/ http://linuxfollies.blogspot.com/
-------------- next part --------------
An HTML attachment was scrubbed...
More information about the torqueusers