[torqueusers] problem getting files copied
Christopher Samuel
samuel at unimelb.edu.au
Thu Jul 8 19:57:51 MDT 2010
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1
On 08/07/10 23:05, Andreas Davour wrote:
> Looking in /var/spool/torque I find nothing looking like
> ER or OU files or any uncopied files in the undelivered
> directory.
Your job isn't getting far enough for that to happen
would be my guess.
> On the only node inline the mom log say:
> 07/08/2010 14:52:09;0001; pbs_mom;Job;TMomFinalizeJob3;start failed,
> improper sid
In your syslog you will likely see something like:
"read of pipe for sid failed for job %s (%d of %d bytes)"
as that's the error right before the "improper sid" report.
The test that is failing is:
if (ReadSize != sizeof(sjr))
where ReadSize is passed into the function TMomFinalizeJob3
and sjr is defined as:
struct startjob_rtn sjr;
I don't know this area of code, so would need to dig a bit
deeper, but I presume the problem is occuring before that
to cause ReadSize to not match it.
cheers,
Chris
- --
Christopher Samuel - Senior Systems Administrator
VLSCI - Victorian Life Sciences Computational Initiative
Email: samuel at unimelb.edu.au Phone: +61 (0)3 903 55545
http://www.vlsci.unimelb.edu.au/
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.10 (GNU/Linux)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org/
iEYEARECAAYFAkw2gh8ACgkQO2KABBYQAh80jgCfcLZW6RFQeI9eHNMEYWLfj8wL
olEAoJYeNjlL7ESzNbovZTxqkiWmBMHF
=2cm/
-----END PGP SIGNATURE-----
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://www.supercluster.org/pipermail/torqueusers/attachments/20100709/5ccdab32/attachment.html
More information about the torqueusers
mailing list