AW: [torqueusers] non-existing job

Schulz, Henrik h.schulz at fzd.de
Wed Jan 23 23:29:10 MST 2008


Thanks Garrick,

but unfortunately it does not work. The messages still appear...

Henrik

-----Ursprüngliche Nachricht-----
Von: torqueusers-bounces at supercluster.org [mailto:torqueusers-bounces at supercluster.org] Im Auftrag von Garrick Staples
Gesendet: Mittwoch, 23. Januar 2008 20:36
An: torqueusers at supercluster.org
Betreff: Re: [torqueusers] non-existing job

On Wed, Jan 23, 2008 at 02:39:35PM +0100, Schulz, Henrik alleged:
> Dear all,
> 
> I am running TORQUE 2.1.2 together with MAUI 3.2.6p16. On a couple of nodes I have such a line every 45 seconds in my mom_logs:
> 
> 01/23/2008 00:00:27;0008;   pbs_mom;Job;34422.master;job was terminated
> 
> On other nodes job 34422 produces this line:
> 
> 01/23/2008 14:33:17;0008;   pbs_mom;Job;34422.master;ERROR:    received request 'KILL_JOB' from 10.0.0.91:1023 for job '34422.master' (job does not exist locally)
> 
> The job with number 34422 really existed, but this was about 4 months ago. Maybe there was a problem with this job concerning MPI communication.
> 
> Is there any chance to suppress these messages?

momctl -c 34422 -h <momhostname>






More information about the torqueusers mailing list