[torqueusers] Re: Passwordles login reverted back after reboot of nodes

Gelonia L Dent gdent at amnh.org
Wed May 7 14:29:16 MDT 2008


Thanks for the useful responses to my diskless password issue. I had to
reboot my compute nodes and it seems I'm back to password logins from the
head. Is there any way to make the passwordless login permanent?


--
Gelonia Dent, PhD
Manager of Scientific Computing
Invertebrate Zoology
The American Museum of Natural History
(212) 313-7911




> Send torqueusers mailing list submissions to
> 	torqueusers at supercluster.org
>
> To subscribe or unsubscribe via the World Wide Web, visit
> 	http://www.supercluster.org/mailman/listinfo/torqueusers
> or, via email, send a message with subject or body 'help' to
> 	torqueusers-request at supercluster.org
>
> You can reach the person managing the list at
> 	torqueusers-owner at supercluster.org
>
> When replying, please edit your Subject line so it is more specific
> than "Re: Contents of torqueusers digest..."
>
>
> Today's Topics:
>
>    1. Passwordless_ssh on a diskless cluster (Gelonia L Dent)
>    2. Announcement from Cluster Resources (Rebecca Ventura)
>    3. Re: Passwordless_ssh on a diskless cluster (Daniel Bourque)
>    4. pbs_server: socket_to_handle,	internal socket table full
>       (Chris Samuel)
>    5. Re: pbs_server: socket_to_handle, internal socket	table full
>       (Chris Samuel)
>
>
> ----------------------------------------------------------------------
>
> Message: 1
> Date: Mon, 28 Apr 2008 11:48:29 -0400 (EDT)
> From: "Gelonia L Dent" <gdent at amnh.org>
> Subject: [torqueusers] Passwordless_ssh on a diskless cluster
> To: torqueusers at supercluster.org
> Message-ID:
> 	<49757.216.73.250.156.1209397709.squirrel at webmail.amnh.org>
> Content-Type: text/plain;charset=iso-8859-1
>
> Dear All,
>
> I am trying to eliminate using password login between the headnode and
> compute nodes on a diskless cluster. I've using the following steps:
>
> ____________________
> On head:
> ssh-keygen -t rsa         (didn't enter a pass phrase)
> ssh-copy-id -i ~/.ssh/id_rsa.pub nodeN   (for a subset of the nodes
> (N=0,126))
>
> On compute nodes
> ssh-keygen -t rsa
> ssh-copy-id -i ~/.ssh/id_rsa.pub master
> ______________________
>
> I have three issues:
> First, I did this on ten nodes, then something happened and I had to
> restart the sshd daemon, now I've lost connection to those ten nodes
> completely. I think my authorized_keys file got overwritten when I
> restarted ssh. How can I reach these nodes again?
>       Error message/       ssh: connect to host node10 port 22: No route
> to host
>
> Second, the first time I attempted this (before I broke those 10 nodes) I
> neglected to copy the same id_rsa.pub file to all the nodes and was
> generating a different one for each node. I've since corrected that step
> and am copying the same *.pub file from the head to all the nodes, then
> generating a *.pub from each node to the head. This gives me
> bi-directional access between each node and the head, but not between
> nodes. Do I need passwordless access between nodes for Moab/Torque to work
> properly? If so, how can this be accomplished.
>
> Third, today I am attempting again to go through this process. I have 128
> nodes. Is there a parameter in the sshd_config  file that is set to use
> passwords across the nodes which I may have overlooked. If so, can it be
> used to set a 'no password option' permanently between the head and
> compute nodes.
>
> Any ideas would be greatly appreciated.
>
> Best,
>
> --
> Gelonia Dent, PhD
> Manager of Scientific Computing
> Invertebrate Zoology
> The American Museum of Natural History
> (212) 313-7911
>
>
>
>
>
> ------------------------------
>
> Message: 2
> Date: Mon, 28 Apr 2008 16:08:02 -0600
> From: Rebecca Ventura <rventura at clusterresources.com>
> Subject: [torqueusers] Announcement from Cluster Resources
> To: torqueusers at supercluster.org
> Message-ID: <48164AC2.5080007 at clusterresources.com>
> Content-Type: text/plain; charset=windows-1252; format=flowed
>
> Due to high demand, the early bird registration discount for Moab-Con
> has been extended to May 5. Please register at:
> http://www.clusterresources.com/moabcon/register.php
>
> Yahoo will give the keynote address the first day of the conference
> (Wednesday, May 28) on “Moab in the Data Center.” Other confirmed
> organizations that will be speaking include Lawrence Livermore National
> Labs, Arizona State University, Oak Ridge National Labs, NCSA and others
> from the commercial, academic and government sectors. Topics cover
> everything from green computing and holistic scheduling to cloud
> computing and the adaptive data center.
>
> The Provo Marriott Hotel and Conference Center, where Moab-Con will be
> held, is offering a discount to conference attendees. The reservation
> deadline to receive the conference rate is May 5^th , so be sure to
> reserve your room by then. For reservation information go to:
> http://clusterresources.com/moabcon/hotel.php
>
> We look forward to seeing you at Moab-Con!
>
> --
> Rebecca Ventura
> Marketing Department
> Cluster Resources, Inc.
> rventura at clusterresources.com
> (801) 717-3728
>
>
>
> ------------------------------
>
> Message: 3
> Date: Mon, 28 Apr 2008 17:34:36 -0500
> From: Daniel Bourque <dbourque at weatherdata.com>
> Subject: Re: [torqueusers] Passwordless_ssh on a diskless cluster
> To: Gelonia L Dent <gdent at amnh.org>
> Cc: torqueusers at supercluster.org
> Message-ID: <481650FC.2070602 at weatherdata.com>
> Content-Type: text/plain; charset=ISO-8859-1; format=flowed
>
> first, if you remotely did a "/etc/init.d/sshd restart" , well you
> kicked yourself out. restart run stop then start, connection drops after
> stop. Always do "nohup /etc/init.d/sshd restart"
>
> since you're using diskless, go on the nfs homedir, run
>
> ssh-keygen -t dsa ( or rsa )
> cd .ssh
> cat id_dsa.pub > authorized_keys
> chmod 600 authorized_keys
>
> I'm assuming that the headnode and compute node shares the same nfs
> mounted home dir, and that you want to allow passwordless logins in
> either directions.
>
> Hope this helps
>
> Daniel Bourque
> Sr. Systems Engineer
> WeatherData Service Inc
> An Accuweather Company
>
>
>
> Gelonia L Dent wrote:
>
>>Dear All,
>>
>>I am trying to eliminate using password login between the headnode and
>>compute nodes on a diskless cluster. I've using the following steps:
>>
>>____________________
>>On head:
>>ssh-keygen -t rsa         (didn't enter a pass phrase)
>>ssh-copy-id -i ~/.ssh/id_rsa.pub nodeN   (for a subset of the nodes
>>(N=0,126))
>>
>>On compute nodes
>>ssh-keygen -t rsa
>>ssh-copy-id -i ~/.ssh/id_rsa.pub master
>>______________________
>>
>>I have three issues:
>>First, I did this on ten nodes, then something happened and I had to
>>restart the sshd daemon, now I've lost connection to those ten nodes
>>completely. I think my authorized_keys file got overwritten when I
>>restarted ssh. How can I reach these nodes again?
>>      Error message/       ssh: connect to host node10 port 22: No route
>>to host
>>
>>Second, the first time I attempted this (before I broke those 10 nodes) I
>>neglected to copy the same id_rsa.pub file to all the nodes and was
>>generating a different one for each node. I've since corrected that step
>>and am copying the same *.pub file from the head to all the nodes, then
>>generating a *.pub from each node to the head. This gives me
>>bi-directional access between each node and the head, but not between
>>nodes. Do I need passwordless access between nodes for Moab/Torque to
>> work
>>properly? If so, how can this be accomplished.
>>
>>Third, today I am attempting again to go through this process. I have 128
>>nodes. Is there a parameter in the sshd_config  file that is set to use
>>passwords across the nodes which I may have overlooked. If so, can it be
>>used to set a 'no password option' permanently between the head and
>>compute nodes.
>>
>>Any ideas would be greatly appreciated.
>>
>>Best,
>>
>>--
>>Gelonia Dent, PhD
>>Manager of Scientific Computing
>>Invertebrate Zoology
>>The American Museum of Natural History
>>(212) 313-7911
>>
>>
>>
>>_______________________________________________
>>torqueusers mailing list
>>torqueusers at supercluster.org
>>http://www.supercluster.org/mailman/listinfo/torqueusers
>>
>>
>>
>
>
> ------------------------------
>
> Message: 4
> Date: Tue, 29 Apr 2008 16:30:50 +1000 (EST)
> From: Chris Samuel <csamuel at vpac.org>
> Subject: [torqueusers] pbs_server: socket_to_handle,	internal socket
> 	table full
> To: torqueusers <torqueusers at supercluster.org>
> Message-ID:
> 	<622895861.50921209450650092.JavaMail.root at zimbra.vpac.org>
> Content-Type: text/plain; charset=utf-8
>
> Anyone seeing these messages popping up in syslog (or in
> your pbs_server logs if you've not got Torque compiled
> to use syslog) ?
>
> pbs_server: socket_to_handle, internal socket table full
>
> cheers,
> Chris
> --
> Christopher Samuel - (03) 9925 4751 - Systems Manager
>  The Victorian Partnership for Advanced Computing
>  P.O. Box 201, Carlton South, VIC 3053, Australia
> VPAC is a not-for-profit Registered Research Agency
>
>
> ------------------------------
>
> Message: 5
> Date: Tue, 29 Apr 2008 17:55:59 +1000 (EST)
> From: Chris Samuel <csamuel at vpac.org>
> Subject: Re: [torqueusers] pbs_server: socket_to_handle, internal
> 	socket	table full
> To: torqueusers <torqueusers at supercluster.org>
> Message-ID:
> 	<1500776757.51991209455759536.JavaMail.root at zimbra.vpac.org>
> Content-Type: text/plain; charset=utf-8
>
>
> ----- "Chris Samuel" <csamuel at vpac.org> wrote:
>
>> Anyone seeing these messages popping up in syslog (or in
>> your pbs_server logs if you've not got Torque compiled
>> to use syslog) ?
>>
>> pbs_server: socket_to_handle, internal socket table full
>
> FWIW I believe that this was with 2.3.0, I don't think
> that anyone had restarted the pbs_server after upgrading
> to the current snapshot of 2.3.1 that Steve pushed out
> with some very welcome fixes in it.
>
> cheers,
> Chris
> --
> Christopher Samuel - (03) 9925 4751 - Systems Manager
>  The Victorian Partnership for Advanced Computing
>  P.O. Box 201, Carlton South, VIC 3053, Australia
> VPAC is a not-for-profit Registered Research Agency
>
>
> ------------------------------
>
> _______________________________________________
> torqueusers mailing list
> torqueusers at supercluster.org
> http://www.supercluster.org/mailman/listinfo/torqueusers
>
>
> End of torqueusers Digest, Vol 45, Issue 40
> *******************************************
>




More information about the torqueusers mailing list