[torquedev] [Bug 191] New: pbsdsh doesn't work if nodename differs from hostname.

David Beer dbeer at adaptivecomputing.com
Tue May 8 16:20:20 MDT 2012


Roy,

Sorry, we'll have to fix the documentation. I think I updated the man page
when I originally added that and then I didn't update it again when I
changed the uses for mom_alias.

David

On Tue, May 8, 2012 at 4:11 PM, Roy Dragseth <roy.dragseth at uit.no> wrote:

> Yes, it works! Thanks.
>
> I didn't realize the -A flag would be usable as the man page talks about
> multi-
> mom setups.
>
> Again, thanks for the quick reply,
>
> r.


>
> On Tuesday 8. May 2012 09.52.24 you wrote:
> > Roy,
> >
> > Have you tried starting the nodes with an alias? pbs_mom -A <node_name>
> >
> > David
> >
> > On Tue, May 8, 2012 at 8:00 AM, <bugzilla-daemon at supercluster.org>
> wrote:
> > > http://www.clusterresources.com/bugzilla/show_bug.cgi?id=191
> > >
> > >           Summary: pbsdsh doesn't work if nodename differs from
> hostname.
> > >           Product: TORQUE
> > >           Version: 4.0.*
> > >
> > >          Platform: PC
> > >
> > >        OS/Version: Linux
> > >
> > >            Status: NEW
> > >
> > >          Severity: normal
> > >          Priority: P5
> > >
> > >         Component: pbs_mom
> > >
> > >        AssignedTo: knielson at adaptivecomputing.com
> > >        ReportedBy: roy.dragseth at uit.no
> > >
> > >                CC: torquedev at supercluster.org
> > >
> > >   Estimated Hours: 0.0
> > >
> > > If pbs_mom is started with a name that differs from the hostname pbsdsh
> > > stops
> > > working across multiple nodes.
> > >
> > > My setup has compute nodes with hostnames like compute-X-Y.local and
> node
> > > names
> > > in torque where the .local domain is dropped so the hostlist has
> entries
> > > like
> > > compute-X-Y.
> > >
> > > This used to work fine in torque 2 and 3, but on torque 4.0.1 this will
> > > make
> > > pbsdsh hang indefinitely (and any mpi launcher using libtm).
> > >
> > > Is it possible to have the old behaviour back?
> > >
> > > Regards,
> > > Roy.
> > >
> > > --
> > > Configure bugmail:
> > > http://www.clusterresources.com/bugzilla/userprefs.cgi?tab=email
> > > ------- You are receiving this mail because: -------
> > > You are on the CC list for the bug.
> > > _______________________________________________
> > > torquedev mailing list
> > > torquedev at supercluster.org
> > > http://www.supercluster.org/mailman/listinfo/torquedev
> --
>
>  The Computer Center, University of Tromsø, N-9037 TROMSØ Norway.
>              phone:+47 77 64 41 07, fax:+47 77 64 41 00
>        Roy Dragseth, Team Leader, High Performance Computing
>         Direct call: +47 77 64 62 56. email: roy.dragseth at uit.no
>
> _______________________________________________
> torquedev mailing list
> torquedev at supercluster.org
> http://www.supercluster.org/mailman/listinfo/torquedev
>



-- 
David Beer | Software Engineer
Adaptive Computing
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://www.supercluster.org/pipermail/torquedev/attachments/20120508/477cb660/attachment.html 


More information about the torquedev mailing list