Bugzilla – Bug 191
pbsdsh doesn't work if nodename differs from hostname.
Last modified: 2012-05-09 10:45:00 MDT
You need to log in before you can comment on or make changes to this bug.
If pbs_mom is started with a name that differs from the hostname pbsdsh stops working across multiple nodes. My setup has compute nodes with hostnames like compute-X-Y.local and node names in torque where the .local domain is dropped so the hostlist has entries like compute-X-Y. This used to work fine in torque 2 and 3, but on torque 4.0.1 this will make pbsdsh hang indefinitely (and any mpi launcher using libtm). Is it possible to have the old behaviour back? Regards, Roy.
Using the -A flag solved the problem (As proposed by dbeer). r.
Just to help understand David's question, the -A option was created to help with multi-mom mode. If you use an alias name the alias must be resolvable. In our environments we add the alias to the /etc/hosts file.
Yes, on Rocks compute-X-Y is resolvable in the cluster DNS so this works fine for my torque-roll setup.