[Mauiusers] running jobs & restarting maui
Chris Samuel
csamuel at vpac.org
Tue Nov 8 15:26:02 MST 2005
On Tue, 8 Nov 2005 08:20 pm, Thomas Dargel wrote:
> 11/08 09:20:41 ALERT: job '561' in state 'Running' has exceeded its
> wallclock limit (0+S:0) by 16:43:00 (job will be cancelled)
> 11/08 09:20:41 MSysRegEvent(JOBWCVIOLATION: job '561' in state 'Running'
> has exceeded its wallclock limit (0) by 16:43:00 (job will be
> cancelled) job start
Bingo - the jobs are running with a walltime of 0 (i.e. not set) and for some
reason whilst Maui considers this to be infinite if the job is submitted
whilst it's running when it restarts it sees this as 0 and so kills the
job. :-(
> Do I have to set a 'wallclock limit' in maui.cfg or when the job is
> submitted?
When the job is submitted - you can set default walltimes on queues, that's
what we do here (primarily for a commercial package where the frontend
GUI doesn't permit the user to specify a walltime, but does let them select a
queue to run in).
Queue Memory CPU Time Walltime Node Run Que Lm State
---------------- ------ -------- -------- ---- --- --- -- -----
[...]
run_1_hour -- -- 01:00:00 -- 0 0 -- E R
run_4_hours -- -- 04:00:00 -- 0 0 -- E R
run_12_hours -- -- 12:00:00 -- 0 0 -- E R
run_1_day -- -- 24:00:00 -- 0 0 -- E R
run_3_days -- -- 73:00:00 -- 0 0 -- E R
run_1_week -- -- 168:00:0 -- 2 0 -- E R
run_1_month -- -- 744:00:0 -- 3 0 -- E R
run_2_months -- -- 1488:00: -- 0 0 -- E R
run_3_months -- -- 2232:00: -- 0 0 -- E R
cheers!
Chris
--
Christopher Samuel - (03)9925 4751 - VPAC Deputy Systems Manager
Victorian Partnership for Advanced Computing http://www.vpac.org/
Bldg 91, 110 Victoria Street, Carlton South, VIC 3053, Australia
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: not available
Url : http://www.supercluster.org/pipermail/mauiusers/attachments/20051109/ac5bf71f/attachment.bin
More information about the mauiusers
mailing list