[torqueusers] problem getting files copied

Christopher Samuel samuel at unimelb.edu.au
Thu Jul 8 19:57:51 MDT 2010


-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

On 08/07/10 23:05, Andreas Davour wrote:

> Looking in /var/spool/torque I find nothing looking like
> ER or OU files or any uncopied files in the undelivered
> directory.

Your job isn't getting far enough for that to happen
would be my guess.

> On the only node inline the mom log say:
> 07/08/2010 14:52:09;0001;   pbs_mom;Job;TMomFinalizeJob3;start failed, 
> improper sid

In your syslog you will likely see something like:

"read of pipe for sid failed for job %s (%d of %d bytes)"

as that's the error right before the "improper sid" report.

The test that is failing is:

        if (ReadSize != sizeof(sjr))

where ReadSize is passed into the function TMomFinalizeJob3
and sjr is defined as:

        struct startjob_rtn sjr;

I don't know this area of code, so would need to dig a bit
deeper, but I presume the problem is occuring before that
to cause ReadSize to not match it.

cheers,
Chris
- -- 
 Christopher Samuel - Senior Systems Administrator
 VLSCI - Victorian Life Sciences Computational Initiative
 Email: samuel at unimelb.edu.au Phone: +61 (0)3 903 55545
         http://www.vlsci.unimelb.edu.au/

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.10 (GNU/Linux)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org/

iEYEARECAAYFAkw2gh8ACgkQO2KABBYQAh80jgCfcLZW6RFQeI9eHNMEYWLfj8wL
olEAoJYeNjlL7ESzNbovZTxqkiWmBMHF
=2cm/
-----END PGP SIGNATURE-----
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://www.supercluster.org/pipermail/torqueusers/attachments/20100709/5ccdab32/attachment.html 


More information about the torqueusers mailing list