[torqueusers] my new pbs server is not working

shibo kuang s.b.kuang at gmail.com
Sat Mar 6 07:09:21 MST 2010


Hi,
I just fix the problem using password  free between the computing node and
the master.
But now i got another problem:
in r8.e19, it says
/home/kuang/sharpbend/s1/r8: No such file or directory.
if only one computer is used, the sever can work normally.
Where is missed by me when I install the torque?
Your help would be greatly appreciated.
Cheers,
Shibo Kuang



On Sun, Mar 7, 2010 at 12:46 AM, shibo kuang <s.b.kuang at gmail.com> wrote:

> Hi all,
> I tried to install a pbs server for my two centos linux computers (each
> have 8 cores), but failed..
> Here is my problem:
> if i treat one computer as master for runnig pbs_server, as well as a
> computing node. I can submit jobs using script without any problem. All jobs
> give the exact results.
> However, when one computer is treated as a master, and another is a
> compting node. jobs ara never submitted sucessfully.
> I would appreciate your hints and suggestions according the
> following prompts i got.
> Regards,
> Shibo Kuang
>
> Return-Path: <adm at master>
> Received: from master (localhost [127.0.0.1])
>         by master (8.13.1/8.13.1) with ESMTP id o26DwKF9006310
>         for <kuang at master>; Sun, 7 Mar 2010 00:28:20 +1030
> Received: (from root at localhost)
>         by master (8.13.1/8.13.1/Submit) id o26DwKpZ006293
>         for kuang at master; Sun, 7 Mar 2010 00:28:20 +1030
> Date: Sun, 7 Mar 2010 00:28:20 +1030
> From: adm <adm at master>
> Message-Id: <201003061358.o26DwKpZ006293 at master>
> To: kuang at master
> Subject: PBS JOB 18.master
> Precedence: bulk
> PBS Job Id: 18.master
> Job Name:   r8
> Exec host:  par1/0
> An error has occurred processing your job, see below.
> Post job file processing error; job 18.master on host par1/0
> Unable to copy file /var/spool/torque/spool/18.master.OU to
> kuang at master:/home/kuang/sharpbend/s1/r8/r8.o18
> *** error from copy
> Permission denied (publickey,gssapi-with-mic,password).
> lost connection
> *** end error output
> Output retained on that host in: /var/spool/torque/undelivered/18.master.OU
> Unable to copy file /var/spool/torque/spool/18.master.ER<http://18.master.er/>to
> kuang at master:/home/kuang/sharpbend/s1/r8/r8.e18
> *** error from copy
> Permission denied (publickey,gssapi-with-mic,password).
> lost connection
> *** end error output
> Output retained on that host in: /var/spool/torque/undelivered/
> 18.master.ER <http://18.master.er/>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://www.supercluster.org/pipermail/torqueusers/attachments/20100307/a63672ca/attachment-0001.html 


More information about the torqueusers mailing list