[torqueusers] .OU and .ER files not being sent to user

Steve Young chemadm at hamilton.edu
Thu Feb 28 07:51:53 MST 2008


Hi Adil,
	Have you verified that you can use passwordless ssh to copy files  
between hosts? Just a thought.

-Steve

On Feb 25, 2008, at 9:42 AM, Adil Mughal wrote:

> Dear Experts
>
> At the moment I am unable to have the .OU and .ER files copied to the
> directory I would like them to go to.
>
> Let me try to explain as carefully as possible how I have things  
> set up.
>
> (1) I have nfs running and if from the master I type
>
>> df
>
> then this is what I get
>
> Filesystem           1K-blocks      Used Available Use% Mounted on
> /dev/sda1              9920592    670548   8737976   8% /
> /dev/sda4            249215768  13355476 222996648   6% /data
> /dev/sda2             39674224   3358824  34267516   9% /usr
> tmpfs                  1032344         0   1032344   0% /dev/shm
> dphpc1001.dph.aber.ac.uk:/data
>                      249216000   1285120 235067136   1% /data01
> dphpc1002.dph.aber.ac.uk:/data
>                      249216000   1377792 234974464   1% /data02
>
>
> (2) In my mom_priv/config file I have the following:
>
> $pbsserver dphpc1011.dph.aber.ac.uk
> $usecp dphpc1011.dph.aber.ac.uk:/users/guest1 /users/guest1
> $logevent       255
>
>
> (3) I have the following symbollic links set up under the  
> directory /users
>
> lrwxrwxrwx 1 root root   12 2008-01-28 14:03 guest1 -> /data/guest1
> lrwxrwxrwx 1 root root   12 2008-01-28 14:03 guest2 -> /data/guest2
>
> (4) I get the following types of error messages mailed to me
>
> PBS Job Id: 168.dphpc1011.dph.aber.ac.uk
> Job Name:   STDIN
> An error has occurred processing your job, see below.
> Post job file processing error; job 168.dphpc1011.dph.aber.ac.uk on
> host dphpc1002.dph.aber.ac.uk/1
>
> Unable to copy file /var/spool/torque/spool/168.dphpc10.OU to
> guest1 at dphpc1011.dph.aber.ac.uk:/data01/guest1/STDIN.o168
>>>> error from copy
> Host key verification failed.
> lost connection
>>>> end error output
> Output retained on that host in: /var/spool/torque/undelivered/ 
> 168.dphpc10.OU
>
> Unable to copy file /var/spool/torque/spool/168.dphpc10.ER to
> guest1 at dphpc1011.dph.aber.ac.uk:/data01/guest1/STDIN.e168
>>>> error from copy
> Host key verification failed.
> lost connection
>>>> end error output
> Output retained on that host in: /var/spool/torque/undelivered/ 
> 168.dphpc10.ER
>
> Why is it saying "Host key verification failed" when I am using (?)
> and nfs system??
>
> Many thanks in advance
>
> adil
> _______________________________________________
> torqueusers mailing list
> torqueusers at supercluster.org
> http://www.supercluster.org/mailman/listinfo/torqueusers

-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://www.supercluster.org/pipermail/torqueusers/attachments/20080228/bcd3cfb2/attachment.html


More information about the torqueusers mailing list