[Mauiusers] Job Reservation Slipping

Ansgar Esztermann aeszter at gwdg.de
Tue Dec 16 10:16:57 MST 2008


Hello everyone,


we have a strange problem with maui: with backfill enabled,  
reservations for high-priority four-node jobs are slipping forward in  
time; thus, low-priority two-node jobs manage to occupy free nodes,  
effectively bypassing the queue.
A reservation spanning both free and occupied nodes would be created,  
keeping the free nodes free. Then, the other nodes that are part of  
the reservation would become available, and the reservation would be  
moved to a later point in time. Consequently, low-priority jobs would  
be backfilled into the now-freed nodes.
At first, I could not make heads or tails of this behaviour, but  
looking through maui.log, I suspect something may be amiss with our  
partitions: it seems the reservation for job 2645 is created within  
the default partition rather than one of IBA, IBB, IBC.

However, I am at a loss as to how to change the configuration.




12/16 18:01:17  
MPBSJobUpdate(2645,2645.master1.beowulf.cluster,TaskList,0)
12/16 18:01:17 INFO:     192 feasible tasks found for job 2645:0 in  
partition DEFAULT (32 Needed)
12/16 18:01:17 MJobPReserve(2645,DEFAULT,ResCount,ResCountRej)
12/16 18:01:17 INFO:     192 feasible tasks found for job 2645:0 in  
partition DEFAULT (32 Needed)
12/16 18:01:17 INFO:     192 feasible tasks found for job 2645:0 in  
partition IBA (32 Needed)
12/16 18:01:17 INFO:     192 feasible tasks found for job 2645:0 in  
partition IBB (32 Needed)
12/16 18:01:17 INFO:     192 feasible tasks found for job 2645:0 in  
partition DEFAULT (32 Needed)
12/16 18:01:17 INFO:     located resources for 32 tasks (32) in best  
partition DEFAULT for job 2645 at time 00:07:26
12/16 18:01:17 INFO:     tasks located for job 2645:  32 of 32  
required (0 feasible)
12/16 18:01:17 INFO:     job '2645' reserved 32 tasks (partition  
DEFAULT) to start in 00:07:26 on Tue Dec 16 18:08:43
12/16 18:01:19  
MPBSJobUpdate(2645,2645.master1.beowulf.cluster,TaskList,0)
12/16 18:01:19 INFO:     192 feasible tasks found for job 2645:0 in  
partition DEFAULT (32 Needed)
12/16 18:01:19 MJobPReserve(2645,DEFAULT,ResCount,ResCountRej)
12/16 18:01:19 INFO:     192 feasible tasks found for job 2645:0 in  
partition DEFAULT (32 Needed)
12/16 18:01:19 INFO:     192 feasible tasks found for job 2645:0 in  
partition IBA (32 Needed)
12/16 18:01:19 INFO:     192 feasible tasks found for job 2645:0 in  
partition IBB (32 Needed)
12/16 18:01:19 INFO:     192 feasible tasks found for job 2645:0 in  
partition IBB (32 Needed)
12/16 18:01:19 INFO:     located resources for 32 tasks (32) in best  
partition IBB for job 2645 at time 00:12:43
12/16 18:01:19 INFO:     tasks located for job 2645:  32 of 32  
required (0 feasible)
12/16 18:01:19 INFO:     job '2645' reserved 32 tasks (partition IBB)  
to start in 00:12:43 on Tue Dec 16 18:14:02


Looking forward to any suggestions,


A.

-- 
Ansgar Esztermann
DV-Systemadministration
Max-Planck-Institut für biophysikalische Chemie, Abteilung 105



More information about the mauiusers mailing list