[torqueusers] Interaction with NFS caused by high job count
Kevin Van Workum
vanw at sabalcore.com
Thu Mar 17 10:51:13 MDT 2011
Has anybody ever noticed any problems with mounting NFS's on the machine
running pbs_server?
We've seen some issues when the pbs server machine tries to mount NFS shares
if we have a large number of running jobs (700-1000 jobs). The error is:
mount.nfs: input/output error
The error is inconsistent. Sometimes it works, other times not. I'm guessing
I have to many tcp connections open, but it seems like 1000 jobs shouldn't
cause a problem. Any ideas?
--
Kevin Van Workum, PhD
Sabalcore Computing Inc.
Run your code on 500 processors.
Sign up for a free trial account.
www.sabalcore.com
877-492-8027 ext. 11
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://www.supercluster.org/pipermail/torqueusers/attachments/20110317/662368bd/attachment-0001.html
More information about the torqueusers
mailing list