[torqueusers] pbs_mom + NFS problems starting

scoggins jscoggins at lbl.gov
Wed Nov 15 16:24:33 MST 2006

I am trying to start pbs_mom with the SPOOLDIR NFS mounted.  I have  
include a -d option poiniting to a different location for each node  
and I am constantly getting this error:

pbs_mom -d /var/spool/torque/node0000

returns the error:

    -bash-3.00# pwd
-bash-3.00# -bash-3.00# ls -l
total 12
-rw-r--r--  1 root root  139 Jul 15 00:39 config
-rw-r--r--  1 root root  139 Jul 15 00:28 config.save
drwxr-x--x  2 root root 4096 May  5  2006 jobs
-bash-3.00# pbs_mom -d /var/spool/torque/node0000
pbs_mom: No locks available (37) in pbs_mom, cannot lock '/var/spool/ 
torque/node0000/mom_priv/mom.lock' - another mom running
cannot lock '/var/spool/torque/node0000/mom_priv/mom.lock' - another  
mom running

-bash-3.00# df -k | grep torqu
                        9851328    796832   8554080   9% /var/spool/ 

Node and master are running torque-2.0.0p4 which I know I need to  
update.  But I could never get this one to work on this cluster  
only.  All other clusters worked fine.

  Does anyone know what could be the problem?



