[Mauiusers] Multiple standing reservations with TASKCOUNT mutually exclude nodes from *all* users

Chris Samuel csamuel at vpac.org
Sun Sep 26 23:35:48 MDT 2004


On Sat, 25 Sep 2004 04:01 am, Wightman wrote:

An update to the one I forgot to send to the list..

> You are correct, our testing was not complete.  However, our testing in
> Moab does show correct behavior (Moab 4.2.0).
>
> Were you testing Moab 4.0.4 or Moab 4.2.0?  The fix was rolled into
> 4.2.0 but not into 4.0.4 (until today).

Moab 4.0.4, I can't see 4.2.0 in the usual places..

Just tried it with moab-4.0.4p8-snap.1096046762, still no luck. :-(

# showres -n | fgrep .0
node084                    User     sque.0.1053946        N/A    1   -00:01:03     4:34:09  Mon Sep 27 15:25:51
node085                    User      sr1.0.1053951        N/A    1   -00:00:11    INFINITE  Mon Sep 27 15:26:43
node085                    User      sr2.0.1053951        N/A    1   -00:00:11    INFINITE  Mon Sep 27 15:26:43
node085                    User     sque.0.1053946        N/A    1   -00:01:03     4:34:09  Mon Sep 27 15:25:51

SRCFG[sr1] PERIOD=INFINITY
SRCFG[sr1] TASKCOUNT=1 FLAGS=SPACEFLEX
SRCFG[sr1] ACCESS=DEDICATEDRESOURCE
SRCFG[sr1] USERLIST=csamuel

SRCFG[sr2] PERIOD=INFINITY
SRCFG[sr2] TASKCOUNT=1 FLAGS=SPACEFLEX
SRCFG[sr2] ACCESS=DEDICATEDRESOURCE
SRCFG[sr2] USERLIST=dbannon

SRCFG[sque] STARTTIME=08:00:00 ENDTIME=20:00:00
SRCFG[sque] PERIOD=DAY DAYS=MON,TUE,WED,THU,FRI
SRCFG[sque] PROCLIMIT<=4 DEPTH=7
SRCFG[sque] HOSTLIST=node084,node085
SRCFG[sque] MAXTIME=00:15:00+*
SRCFG[sque] ACCESS=DEDICATEDRESOURCE

Changing those back to DEDICATED showed that none of them could get onto
the cluster, even though there were nodes free.. :-(

09/27 15:29:59 MReqCreate(218774,SrcRQ,DstRQ,FALSE)
09/27 15:29:59 INFO:     processing node request line '1'
09/27 15:29:59 INFO:     job '218774' loaded:   1   doehme    users   1800       Idle   0 1096262998   [NONE] [NONE] [NONE] >=      0 >=      0 [NONE] 1096262999
09/27 15:29:59 INFO:     137 PBS jobs detected on RM base
09/27 15:29:59 INFO:     jobs detected: 137
09/27 15:29:59 MRMQueueQuery(QCount,EMsg,SC)
09/27 15:29:59 MPBSLoadQueueInfo(base,NULL,SC)
09/27 15:29:59 INFO:     0 PBS jobs detected on RM base
09/27 15:29:59 INFO:     no queues detected
09/27 15:29:59 ALERT:    job sr1.0.1053951 cannot run in any partition
09/27 15:29:59 ALERT:    cannot select 2 procs in partition '[ALL]' for rsv 'sr1.0.1053951'
09/27 15:29:59 ALERT:    cannot create standing reservation 'sr1'
09/27 15:29:59 ALERT:    job sr2.0.1053951 cannot run in any partition
09/27 15:29:59 ALERT:    cannot select 2 procs in partition '[ALL]' for rsv 'sr2.0.1053951'
09/27 15:29:59 ALERT:    cannot create standing reservation 'sr2'
09/27 15:29:59 ALERT:    job sque.0.1053951 cannot run in any partition
09/27 15:29:59 ALERT:    cannot select 4 procs in partition '[ALL]' for rsv 'sque.0.1053951'
09/27 15:29:59 ALERT:    cannot create standing reservation 'sque'
09/27 15:29:59 ALERT:    job sque.1.1053951 cannot run in any partition
09/27 15:29:59 ALERT:    cannot select 4 procs in partition '[ALL]' for rsv 'sque.1.1053951'
09/27 15:29:59 ALERT:    cannot create standing reservation 'sque'
09/27 15:29:59 ALERT:    job sque.2.1053951 cannot run in any partition
09/27 15:29:59 ALERT:    cannot select 4 procs in partition '[ALL]' for rsv 'sque.2.1053951'
09/27 15:29:59 ALERT:    cannot create standing reservation 'sque'
09/27 15:29:59 ALERT:    job sque.3.1053951 cannot run in any partition
09/27 15:29:59 ALERT:    cannot select 4 procs in partition '[ALL]' for rsv 'sque.3.1053951'
09/27 15:29:59 ALERT:    cannot create standing reservation 'sque'
09/27 15:29:59 ALERT:    job sque.4.1053951 cannot run in any partition
09/27 15:29:59 ALERT:    cannot select 4 procs in partition '[ALL]' for rsv 'sque.4.1053951'
09/27 15:29:59 ALERT:    cannot create standing reservation 'sque'

Changing them back to DEDICATEDRESOURCE got them working again, this time with just sr1 and sr2
sharing the same node..

cheers,
Chris
-- 
 Christopher Samuel - (03)9925 4751 - VPAC Systems & Network Admin
 Victorian Partnership for Advanced Computing http://www.vpac.org/
 Bldg 91, 110 Victoria Street, Carlton South, VIC 3053, Australia

-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: not available
Url : http://www.supercluster.org/pipermail/mauiusers/attachments/20040927/8f8c0883/attachment.bin


More information about the mauiusers mailing list