[Mauiusers] running jobs & restarting maui

Chris Samuel csamuel at vpac.org
Tue Nov 8 15:26:02 MST 2005


On Tue, 8 Nov 2005 08:20 pm, Thomas Dargel wrote:

> 11/08 09:20:41 ALERT:    job '561' in state 'Running' has exceeded its
> wallclock limit (0+S:0) by 16:43:00 (job will be cancelled)
> 11/08 09:20:41 MSysRegEvent(JOBWCVIOLATION:  job '561' in state 'Running'
> has exceeded its wallclock limit (0) by 16:43:00 (job will be
> cancelled)  job start 

Bingo - the jobs are running with a walltime of 0 (i.e. not set) and for some 
reason whilst Maui considers this to be infinite if the job is submitted 
whilst it's running when it restarts it sees this as 0 and so kills the 
job. :-(

> Do I have to set a 'wallclock limit' in maui.cfg or when the job is
> submitted? 

When the job is submitted - you can set default walltimes on queues, that's
what we do here (primarily for a commercial package where the frontend
GUI doesn't permit the user to specify a walltime, but does let them select a 
queue to run in).

Queue            Memory CPU Time Walltime Node Run Que Lm  State
---------------- ------ -------- -------- ---- --- --- --  -----
[...]
run_1_hour         --      --    01:00:00  --    0   0 --   E R
run_4_hours        --      --    04:00:00  --    0   0 --   E R
run_12_hours       --      --    12:00:00  --    0   0 --   E R
run_1_day          --      --    24:00:00  --    0   0 --   E R
run_3_days         --      --    73:00:00  --    0   0 --   E R
run_1_week         --      --    168:00:0  --    2   0 --   E R
run_1_month        --      --    744:00:0  --    3   0 --   E R
run_2_months       --      --    1488:00:  --    0   0 --   E R
run_3_months       --      --    2232:00:  --    0   0 --   E R

cheers!
Chris
-- 
 Christopher Samuel - (03)9925 4751 - VPAC Deputy Systems Manager
 Victorian Partnership for Advanced Computing http://www.vpac.org/
 Bldg 91, 110 Victoria Street, Carlton South, VIC 3053, Australia

-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: not available
Url : http://www.supercluster.org/pipermail/mauiusers/attachments/20051109/ac5bf71f/attachment.bin


More information about the mauiusers mailing list