[torqueusers] my new pbs server is not working

shibo kuang s.b.kuang at gmail.com
Sat Mar 6 07:12:24 MST 2010


"/home/kuang/sharpbend/s1/r8: No such file or directory."
my node does  not have the directory, but my master has it.



On Sun, Mar 7, 2010 at 1:09 AM, shibo kuang <s.b.kuang at gmail.com> wrote:

> Hi,
> I just fix the problem using password  free between the computing node and
> the master.
> But now i got another problem:
> in r8.e19, it says
> /home/kuang/sharpbend/s1/r8: No such file or directory.
> if only one computer is used, the sever can work normally.
> Where is missed by me when I install the torque?
> Your help would be greatly appreciated.
> Cheers,
> Shibo Kuang
>
>
>
> On Sun, Mar 7, 2010 at 12:46 AM, shibo kuang <s.b.kuang at gmail.com> wrote:
>
>> Hi all,
>> I tried to install a pbs server for my two centos linux computers (each
>> have 8 cores), but failed..
>> Here is my problem:
>> if i treat one computer as master for runnig pbs_server, as well as a
>> computing node. I can submit jobs using script without any problem. All jobs
>> give the exact results.
>> However, when one computer is treated as a master, and another is a
>> compting node. jobs ara never submitted sucessfully.
>> I would appreciate your hints and suggestions according the
>> following prompts i got.
>> Regards,
>> Shibo Kuang
>>
>> Return-Path: <adm at master>
>> Received: from master (localhost [127.0.0.1])
>>         by master (8.13.1/8.13.1) with ESMTP id o26DwKF9006310
>>         for <kuang at master>; Sun, 7 Mar 2010 00:28:20 +1030
>> Received: (from root at localhost)
>>         by master (8.13.1/8.13.1/Submit) id o26DwKpZ006293
>>         for kuang at master; Sun, 7 Mar 2010 00:28:20 +1030
>> Date: Sun, 7 Mar 2010 00:28:20 +1030
>> From: adm <adm at master>
>> Message-Id: <201003061358.o26DwKpZ006293 at master>
>> To: kuang at master
>> Subject: PBS JOB 18.master
>> Precedence: bulk
>> PBS Job Id: 18.master
>> Job Name:   r8
>> Exec host:  par1/0
>> An error has occurred processing your job, see below.
>> Post job file processing error; job 18.master on host par1/0
>> Unable to copy file /var/spool/torque/spool/18.master.OU to
>> kuang at master:/home/kuang/sharpbend/s1/r8/r8.o18
>> *** error from copy
>> Permission denied (publickey,gssapi-with-mic,password).
>> lost connection
>> *** end error output
>> Output retained on that host in:
>> /var/spool/torque/undelivered/18.master.OU
>> Unable to copy file /var/spool/torque/spool/18.master.ER<http://18.master.er/>to
>> kuang at master:/home/kuang/sharpbend/s1/r8/r8.e18
>> *** error from copy
>> Permission denied (publickey,gssapi-with-mic,password).
>> lost connection
>> *** end error output
>> Output retained on that host in: /var/spool/torque/undelivered/
>> 18.master.ER <http://18.master.er/>
>>
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://www.supercluster.org/pipermail/torqueusers/attachments/20100307/7e7bfee8/attachment-0001.html 


More information about the torqueusers mailing list