[torqueusers] Corrupt cookie running Torque 1.2.0p5

Cliff Kirby ckirby3 at colsa.com
Fri Oct 14 09:53:37 MDT 2005


I’m seeing these messages repeated in my mom logs that I believe are causing
my job to fail.  We are seeing intermittent LDAP problems so my guess is
they have something to do with authentication based on the messages at the
end.  Any other ideas?

 

10/14/2005 10:08:49;0008;   pbs_mom;Job;29134.mach5c.mach5.roc;ERROR:
received request 'ERROR' from 172.16.15.5:15003 for job
'29134.mach5c.mach5.roc' (job has corrupt cookie -
'4D44890364FBDE1D4FD75EAB380328DA' != 'B2F9CC4034B418E459814AB93AC8EBD4')

10/14/2005 10:08:49;0008;   pbs_mom;Job;29134.mach5c.mach5.roc;ERROR:
received request 'ERROR' from 172.16.15.4:15003 for job
'29134.mach5c.mach5.roc' (job has corrupt cookie -
'4D44890364FBDE1D4FD75EAB380328DA' != 'B2F9CC4034B418E459814AB93AC8EBD4')

10/14/2005 10:08:49;0008;   pbs_mom;Job;29134.mach5c.mach5.roc;ERROR:
received request 'ERROR' from 172.16.15.3:15003 for job
'29134.mach5c.mach5.roc' (job has corrupt cookie -
'4D44890364FBDE1D4FD75EAB380328DA' != 'B2F9CC4034B418E459814AB93AC8EBD4')

10/14/2005 10:08:49;0008;   pbs_mom;Job;29134.mach5c.mach5.roc;ERROR:
received request 'ERROR' from 172.16.15.2:15003 for job
'29134.mach5c.mach5.roc' (job has corrupt cookie -
'4D44890364FBDE1D4FD75EAB380328DA' != 'B2F9CC4034B418E459814AB93AC8EBD4')

10/14/2005 10:08:49;0008;   pbs_mom;Job;29134.mach5c.mach5.roc;ERROR:
received request 'ERROR' from 172.16.14.40:15003 for job
'29134.mach5c.mach5.roc' (job has corrupt cookie -
'4D44890364FBDE1D4FD75EAB380328DA' != 'B2F9CC4034B418E459814AB93AC8EBD4')

…

10/14/2005 10:09:29;0001;   pbs_mom;Svr;pbs_mom;Bad UID for job execution
(15023) in 29134.mach5c.mach5.roc, job_start_error from node
172.16.27.7:15003 in job_start_error

10/14/2005 10:09:29;0001;   pbs_mom;Svr;pbs_mom;Bad UID for job execution
(15023) in 29134.mach5c.mach5.roc, abort attempted 16 times in
job_start_error.  ignoring abort request from node 172.16.27.7:15003

10/14/2005 10:09:29;0008;   pbs_mom;Req;send_sisters;sending ABORT to
sisters

10/14/2005 10:09:29;0001;   pbs_mom;Svr;pbs_mom;Bad UID for job execution
(15023) in 29134.mach5c.mach5.roc, job_start_error from node
172.16.24.35:15003 in job_start_error

10/14/2005 10:09:29;0001;   pbs_mom;Svr;pbs_mom;Bad UID for job execution
(15023) in 29134.mach5c.mach5.roc, abort attempted 16 times in
job_start_error.  ignoring abort request from node 172.16.24.35:15003

10/14/2005 10:09:29;0008;   pbs_mom;Req;send_sisters;sending ABORT to
sisters

 

 


-- 
No virus found in this outgoing message.
Checked by AVG Anti-Virus.
Version: 7.0.344 / Virus Database: 267.11.14/129 - Release Date: 10/11/2005
 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://www.supercluster.org/pipermail/torqueusers/attachments/20051014/81039c25/attachment.html


More information about the torqueusers mailing list