[torqueusers] pbs_mom + NFS problems starting

scoggins jscoggins at lbl.gov
Wed Nov 15 16:24:33 MST 2006


I am trying to start pbs_mom with the SPOOLDIR NFS mounted.  I have  
include a -d option poiniting to a different location for each node  
and I am constantly getting this error:

pbs_mom -d /var/spool/torque/node0000

returns the error:

    -bash-3.00# pwd
/var/spool/torque/node0000/mom_priv
-bash-3.00# -bash-3.00# ls -l
total 12
-rw-r--r--  1 root root  139 Jul 15 00:39 config
-rw-r--r--  1 root root  139 Jul 15 00:28 config.save
drwxr-x--x  2 root root 4096 May  5  2006 jobs
-bash-3.00# pbs_mom -d /var/spool/torque/node0000
pbs_mom: No locks available (37) in pbs_mom, cannot lock '/var/spool/ 
torque/node0000/mom_priv/mom.lock' - another mom running
cannot lock '/var/spool/torque/node0000/mom_priv/mom.lock' - another  
mom running

-bash-3.00# df -k | grep torqu
10.0.0.1:/var/spool/torque
                        9851328    796832   8554080   9% /var/spool/ 
torque


Node and master are running torque-2.0.0p4 which I know I need to  
update.  But I could never get this one to work on this cluster  
only.  All other clusters worked fine.


  Does anyone know what could be the problem?

Thanks

Jackie



More information about the torqueusers mailing list