[torqueusers] hanging jobs/communication error with mixed network
alan at clusterresources.com
Fri Jul 6 04:04:18 MDT 2007
I know it might sound a bit like over-engineering on a small cluster,
but I generally recommend using DNS (bind9 possibly) with a local domain
configured. Where a mix of public/private ips appears, you can start
using DNS views and forget about it :)
EMEA Systems Engineer
Cluster Resources, Ltd.
Adrian Knoth wrote:
> On Thu, Jul 05, 2007 at 06:33:00PM +0100, Alan Mosca wrote:
>> I'm taking a bit of a blind guess, but probably the hostnames you have
>> set up in the "server_priv/nodes" file, are pointing to the public
>> interfaces (where available), so maybe fixing that by either changing
> You're right. I just said "racl00" or "racl01", and /etc/hosts contained
> multiple lines (public and private addresses) for these entries.
> I've now dedicated internal names (like racl00-svc, racl01-svc) for the
> private addresses, thus fixing the problem.
> Thanks for clarification.
More information about the torqueusers