[Mauiusers] Maui/SLURM-wiki and consumable resources other
than processors
Dave Jackson
jacksond at clusterresources.com
Tue Jan 18 15:43:06 MST 2005
Susanne,
> What does SLURM need to provide Maui for this to work?
SLURM needs to provide per job memory requirement or per node memory
utilization information. Maui should be able to manage memory over
subscription if either of these pieces of information are available.
These are specified via the 'DMEM' job attribute and the 'AMEMORY' node
attribute. NOTE: if DMEM is specified, Maui can prevent
oversubscription from occuring. If only AMEMORY is specified, Maui can
only keep it from getting worse once it has occurred.
A level 7 log should indicate exactly what information is being sent
from SLURM to Maui. The 'per job' dedicated memory may need to be
specified within the SLURM job at submission time.
Please let us know what you find.
Dave
On Fri, 2005-01-14 at 16:52 -0500, Balle, Susanne wrote:
> Hi
>
> I am trying to use the "consumable resources" feature in Maui.
>
> I did a test to see if Maui register the amount of memory used
> when running a job with srun (slurm) as it does with processors and
> it doesn't.
>
> I am trying to use the "consumable resource" feature to allow jobs
> to be scheduled more efficiently. I tested this with processors and it
> works as expected. I didn't get any nodes overallocated. In the case of
> memory Maui overallocate my nodes.
>
> As you can see the job run by "test" is running is using %MEM 36.3%.
> Something is wrong with these numbers as well but the basic idea
> is that the program uses a non negligeable amount of memory.
> This usage is not recorded in the output from "diagnose -n".
>
> >From the output from "diagnose -n" we can see that we are using one
> processor. On xc14n16 but the amount of memory usage is not updated.
>
> This point is further highlighed by the output from "checknode xc14n16"
> enclosed below. Only processors are tracked.
>
> Is this a bug? a limitation in the Maui/Slurm-wiki integration?
>
> What does SLURM need to provide Maui for this to work?
>
> Thanks for any help,
>
> Regards,
>
> Susannne
>
> -----------------------------
>
> Output from top:
> ----------------
> Mem: 3905352k av, 2813060k used, 1092292k free, 0k shrd, 188356k
> buff
> 2239404k active, 165076k inactive
> Swap: 6291288k av, 0k used, 6291288k free 325164k
> cached
>
> PID USER PRI NI SIZE RSS SHARE STAT %CPU %MEM TIME CPU
> COMMAND
> 6655 test 25 0 1385M 1.4G 252 R 24.9 36.3 2:44 1
> matmut2
> 1 root 15 0 528 528 452 S 0.0 0.0 0:44 3 init
> 2 root RT 0 0 0 0 SW 0.0 0.0 0:00 0
> migration/0
> 3 root RT 0 0 0 0 SW 0.0 0.0 0:00 1
> migration/1
> 4 root RT 0 0 0 0 SW 0.0 0.0 0:00 2
> migration/2
> 5 root RT 0 0 0 0 SW 0.0 0.0 0:00 3
> migration/3
>
> [root at xc14n16 etc]# diagnose -n
> -------------------------------
> diagnosing node table (5120 slots)
> Name State Procs Memory Disk
> [snip]
> xc14n13 Idle 2:2 2981:2981 12283:12283
> ] [NONE] [NONE]
> xc14n14 Idle 2:2 2981:2981 12283:12283
> ] [NONE] [NONE]
> xc14n15 Idle 2:2 2981:2981 12283:12283
> ] [NONE] [NONE]
> xc14n16 Running 3:4 3813:3813 7867:7867
> ] [NONE] [NONE]
> ----- --- 9:10 12756:12756 44716:44716
> Total Nodes: 4 (Active: 1 Idle: 3 Down: 0)
>
> [root at xc14n16 etc]# checknode xc14n16
>
> checking node xc14n16
>
> State: Running (in current state for 00:00:00)
> Configured Resources: PROCS: 4 MEM: 3813M DISK: 7867M
> Utilized Resources: [NONE]
> Dedicated Resources: PROCS: 1
> Opsys: [NONE] Arch: [NONE]
> Speed: 1.00 Load: 0.000
> Features: [NONE]
> Attributes: [Batch]
> Classes: [NONE]
>
> _______________________________________________
> mauiusers mailing list
> mauiusers at supercluster.org
> http://supercluster.org/mailman/listinfo/mauiusers
More information about the mauiusers
mailing list