[torqueusers] .OU and .ER files not being sent to user
Steve Young
chemadm at hamilton.edu
Thu Feb 28 07:51:53 MST 2008
Hi Adil,
Have you verified that you can use passwordless ssh to copy files
between hosts? Just a thought.
-Steve
On Feb 25, 2008, at 9:42 AM, Adil Mughal wrote:
> Dear Experts
>
> At the moment I am unable to have the .OU and .ER files copied to the
> directory I would like them to go to.
>
> Let me try to explain as carefully as possible how I have things
> set up.
>
> (1) I have nfs running and if from the master I type
>
>> df
>
> then this is what I get
>
> Filesystem 1K-blocks Used Available Use% Mounted on
> /dev/sda1 9920592 670548 8737976 8% /
> /dev/sda4 249215768 13355476 222996648 6% /data
> /dev/sda2 39674224 3358824 34267516 9% /usr
> tmpfs 1032344 0 1032344 0% /dev/shm
> dphpc1001.dph.aber.ac.uk:/data
> 249216000 1285120 235067136 1% /data01
> dphpc1002.dph.aber.ac.uk:/data
> 249216000 1377792 234974464 1% /data02
>
>
> (2) In my mom_priv/config file I have the following:
>
> $pbsserver dphpc1011.dph.aber.ac.uk
> $usecp dphpc1011.dph.aber.ac.uk:/users/guest1 /users/guest1
> $logevent 255
>
>
> (3) I have the following symbollic links set up under the
> directory /users
>
> lrwxrwxrwx 1 root root 12 2008-01-28 14:03 guest1 -> /data/guest1
> lrwxrwxrwx 1 root root 12 2008-01-28 14:03 guest2 -> /data/guest2
>
> (4) I get the following types of error messages mailed to me
>
> PBS Job Id: 168.dphpc1011.dph.aber.ac.uk
> Job Name: STDIN
> An error has occurred processing your job, see below.
> Post job file processing error; job 168.dphpc1011.dph.aber.ac.uk on
> host dphpc1002.dph.aber.ac.uk/1
>
> Unable to copy file /var/spool/torque/spool/168.dphpc10.OU to
> guest1 at dphpc1011.dph.aber.ac.uk:/data01/guest1/STDIN.o168
>>>> error from copy
> Host key verification failed.
> lost connection
>>>> end error output
> Output retained on that host in: /var/spool/torque/undelivered/
> 168.dphpc10.OU
>
> Unable to copy file /var/spool/torque/spool/168.dphpc10.ER to
> guest1 at dphpc1011.dph.aber.ac.uk:/data01/guest1/STDIN.e168
>>>> error from copy
> Host key verification failed.
> lost connection
>>>> end error output
> Output retained on that host in: /var/spool/torque/undelivered/
> 168.dphpc10.ER
>
> Why is it saying "Host key verification failed" when I am using (?)
> and nfs system??
>
> Many thanks in advance
>
> adil
> _______________________________________________
> torqueusers mailing list
> torqueusers at supercluster.org
> http://www.supercluster.org/mailman/listinfo/torqueusers
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://www.supercluster.org/pipermail/torqueusers/attachments/20080228/bcd3cfb2/attachment.html
More information about the torqueusers
mailing list