[torqueusers] problem getting files copied
samuel at unimelb.edu.au
Thu Jul 8 19:57:51 MDT 2010
-----BEGIN PGP SIGNED MESSAGE-----
On 08/07/10 23:05, Andreas Davour wrote:
> Looking in /var/spool/torque I find nothing looking like
> ER or OU files or any uncopied files in the undelivered
Your job isn't getting far enough for that to happen
would be my guess.
> On the only node inline the mom log say:
> 07/08/2010 14:52:09;0001; pbs_mom;Job;TMomFinalizeJob3;start failed,
> improper sid
In your syslog you will likely see something like:
"read of pipe for sid failed for job %s (%d of %d bytes)"
as that's the error right before the "improper sid" report.
The test that is failing is:
if (ReadSize != sizeof(sjr))
where ReadSize is passed into the function TMomFinalizeJob3
and sjr is defined as:
struct startjob_rtn sjr;
I don't know this area of code, so would need to dig a bit
deeper, but I presume the problem is occuring before that
to cause ReadSize to not match it.
Christopher Samuel - Senior Systems Administrator
VLSCI - Victorian Life Sciences Computational Initiative
Email: samuel at unimelb.edu.au Phone: +61 (0)3 903 55545
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.10 (GNU/Linux)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org/
-----END PGP SIGNATURE-----
-------------- next part --------------
An HTML attachment was scrubbed...
More information about the torqueusers