[Mauiusers] Maui/SLURM-wiki and consumable resources other than processors

Dave Jackson jacksond at clusterresources.com
Tue Jan 18 15:43:06 MST 2005


Susanne,

>  What does SLURM need to provide Maui for this to work?

  SLURM needs to provide per job memory requirement or per node memory
utilization information.  Maui should be able to manage memory over
subscription if either of these pieces of information are available.
These are specified via the 'DMEM' job attribute and the 'AMEMORY' node
attribute.  NOTE:  if DMEM is specified, Maui can prevent
oversubscription from occuring.  If only AMEMORY is specified, Maui can
only keep it from getting worse once it has occurred.

  A level 7 log should indicate exactly what information is being sent
from SLURM to Maui.  The 'per job' dedicated memory may need to be
specified within the SLURM job at submission time.  

  Please let us know what you find.

Dave
 
On Fri, 2005-01-14 at 16:52 -0500, Balle, Susanne wrote:
> Hi 
> 
> I am trying to use the "consumable resources" feature in Maui. 
> 
> I did a test to see if Maui register the amount of memory used 
> when running a job with srun (slurm) as it does with processors and 
> it doesn't.
> 
> I am trying to use the "consumable resource" feature to allow jobs 
> to be scheduled more efficiently. I tested this with processors and it 
> works as expected. I didn't get any nodes overallocated. In the case of 
> memory Maui overallocate my nodes.
> 
> As you can see the job run by "test" is running is using %MEM 36.3%. 
> Something is wrong with these numbers as well but the basic idea 
> is that the program uses a non negligeable amount of memory. 
> This usage is not recorded in the output from "diagnose -n".
> 
> >From the output from "diagnose -n" we can see that we are using one 
> processor. On xc14n16 but the amount of memory usage is not updated.
> 
> This point is further highlighed by the output from "checknode xc14n16" 
> enclosed below. Only processors are tracked.
> 
> Is this a bug? a limitation in the Maui/Slurm-wiki integration?
> 
> What does SLURM need to provide Maui for this to work?
> 
> Thanks for any help,
> 
> Regards,
> 
> Susannne
> 
> -----------------------------
> 
> Output from top:
> ----------------
> Mem:  3905352k av, 2813060k used, 1092292k free,       0k shrd,  188356k
> buff
>       2239404k active,             165076k inactive
> Swap: 6291288k av,       0k used, 6291288k free                  325164k
> cached
> 
>   PID USER     PRI  NI  SIZE  RSS SHARE STAT %CPU %MEM   TIME CPU
> COMMAND
>  6655 test      25   0 1385M 1.4G   252 R    24.9 36.3   2:44   1
> matmut2
>     1 root      15   0   528  528   452 S     0.0  0.0   0:44   3 init
>     2 root      RT   0     0    0     0 SW    0.0  0.0   0:00   0
> migration/0
>     3 root      RT   0     0    0     0 SW    0.0  0.0   0:00   1
> migration/1
>     4 root      RT   0     0    0     0 SW    0.0  0.0   0:00   2
> migration/2
>     5 root      RT   0     0    0     0 SW    0.0  0.0   0:00   3
> migration/3
> 
> [root at xc14n16 etc]# diagnose -n
> -------------------------------
> diagnosing node table (5120 slots)
> Name                    State  Procs     Memory         Disk
> [snip]
> xc14n13                  Idle   2:2     2981:2981    12283:12283      
> ]                         [NONE]                         [NONE]
> xc14n14                  Idle   2:2     2981:2981    12283:12283      
> ]                         [NONE]                         [NONE]
> xc14n15                  Idle   2:2     2981:2981    12283:12283      
> ]                         [NONE]                         [NONE]
> xc14n16               Running   3:4     3813:3813     7867:7867       
> ]                         [NONE]                         [NONE]
> -----                     ---   9:10   12756:12756   44716:44716      
> Total Nodes: 4  (Active: 1  Idle: 3  Down: 0)
> 
> [root at xc14n16 etc]# checknode xc14n16
> 
> checking node xc14n16
> 
> State:   Running  (in current state for 00:00:00)
> Configured Resources: PROCS: 4  MEM: 3813M  DISK: 7867M
> Utilized   Resources: [NONE]
> Dedicated  Resources: PROCS: 1
> Opsys:        [NONE]  Arch:      [NONE]
> Speed:      1.00  Load:       0.000
> Features:   [NONE]
> Attributes: [Batch]
> Classes:    [NONE]
> 
> _______________________________________________
> mauiusers mailing list
> mauiusers at supercluster.org
> http://supercluster.org/mailman/listinfo/mauiusers



More information about the mauiusers mailing list