[torqueusers] strange messages in pbs logs
Brock Palen
brockp at umich.edu
Thu Jan 17 09:00:50 MST 2008
I am seeing some messages in my torque server and mom logs that
confuse me:
server_logs
01/17/2008 10:54:36;0001;PBS_Server;Svr;PBS_Server;sync_node_jobs,
stray job 787610.nyx.engin.umich.edu found on nyx051
01/17/2008 10:54:36;0001;PBS_Server;Svr;PBS_Server;sync_node_jobs,
stray job 787678.nyx.engin.umich.edu found on nyx051
01/17/2008 10:54:36;0001;PBS_Server;Svr;PBS_Server;sync_node_jobs,
stray job 787721.nyx.engin.umich.edu found on nyx051
The same time this messages appears in the mom_log of nyx051
01/17/2008 10:56:56;0080; pbs_mom;Req;req_reject;Reject reply
code=15001(Unknown Job Id REJHOST=nyx051.engin.umich.edu MSG=cannot
locate job to delete), aux=0, type=DeleteJob, from
PBS_Server at nyx.engin.umich.edu
01/17/2008 10:56:56;0080; pbs_mom;Req;req_reject;Reject reply
code=15001(Unknown Job Id REJHOST=nyx051.engin.umich.edu MSG=cannot
locate job to delete), aux=0, type=DeleteJob, from
PBS_Server at nyx.engin.umich.edu
The job id's torque is referring to all belong to the same user, and
is valid:
787610.nyx.engin.umi zcwang cac n1_y2000m7 3232 1 --
-- 03:00 R 01:39
The process is running. Do i need to worry about this?
Brock Palen
Center for Advanced Computing
brockp at umich.edu
(734)936-1985
More information about the torqueusers
mailing list