[torqueusers] managing consumable resources

Jerry Smith jdsmit at sandia.gov
Mon Aug 25 14:40:26 MDT 2008


Brian,

What does checknode -v GLOBAL give you?  If I add the GRES line I get:

node GLOBAL

State:      Idle  (in current state for 00:00:02)
Configured Resources: bogus: 2
Utilized   Resources: ---
Dedicated  Resources: ---
Opsys:      ---       Arch:      ---
Speed:      0.00     CPULoad:   0.000
Partition:  SHARED  Rack/Slot:  ---
EffNodeAccessPolicy: SINGLEJOB

Reservations:  ---



--Jerry

Brian O'Connor wrote:
> Hi
>
> We have a 32p Altix running SLES10sp1. It has a 5 user nodelocked
> licensed
> application, lets call it "bogus" ;)
>
> We want torque-2.3.0/maui to manage the allocation of the licenses as a
> consumable
> resource, in the same way that it manages cpu allocation. There is no
> other
> interactive use of bogus, and so it should just be a node locked generic
> consumable resource. The jobs should wait in the queue until a license
> becomes
> available.
>
> I am having a bit of trouble understanding the somewhat confusing
> torque/maui
> documentation.
>
> I first tried without maui(ie using pbs_sched) and adding to
> <torque>mom_priv/config
>
> bogus 5
>
> and then "qsub -l other=bogus script.sh" and
> "qsub -l software=bogus script.sh"
>
> but these jobs run regardless. The bogus resource was not decremented
>
> This is discussed
>
> http://www.clusterresources.com/torquedocs21/a.cmomconfig.shtml
>
> I then tried using maui. I have used maui before in simple
> configuations.
>
> The maui.cfg is
>
> --------------------
> # maui.cfg 3.2.6p19
>
> SERVERHOST            mel-dem1.melbourne.sgi.com
> # primary admin must be first in list
> ADMIN1                root sgi
> ADMIN3                wwwrun
>
> # Resource Manager Definition
>
> RMCFG[base] TYPE=PBS
>
> # Allocation Manager Definition
>
> AMCFG[bank]  TYPE=NONE
>
> # full parameter docs at
> http://supercluster.org/mauidocs/a.fparameters.html
> # use the 'schedctl -l' command to display current configuration
>
> RMPOLLINTERVAL        00:00:30
>
> SERVERPORT            42559
>
> # Admin: http://supercluster.org/mauidocs/a.esecurity.html
>
>
> LOGFILE               maui.log
> LOGFILEMAXSIZE        10000000
> LOGLEVEL              3
>
> # Job Priority:
> http://supercluster.org/mauidocs/5.1jobprioritization.html
>
> QUEUETIMEWEIGHT       1
>
> # FairShare: http://supercluster.org/mauidocs/6.3fairshare.html
>
> FSPOLICY              PSDEDICATED
> FSDEPTH               7
> FSINTERVAL            604800
> FSDECAY               0.80
>
> USERCFG[DEFAULT] MAXIJOB=20 MAXJOB=10,20
>
> NODEMAXLOAD 40
>
> # allow jobs to share node
> NODEACCESSPOLICY  SHARED
>
> # track background load
> NODELOADPOLICY            ADJUSTPROCS
> NODEUNTRACKEDLOADFACTOR   1.2
>
> # favor short jobs, disfavor large jobs
> QUEUETIMEWEIGHT   0
> RESOURCEWEIGHT    -10
> PROCWEIGHT        128
> MEMWEIGHT         1
> XFACTOR           1000
>
>
> # kill resource hogs
> RESOURCEUTILIZATIONPOLICY ALWAYS
> RESOURCEUTILIZATIONACTION CANCEL
> #I added
>
> NODECFG[GLOBAL]  GRES=bogus:5
> ---------------------------------
>
> and used "qsub -W x=GRES:bogus+1 script.sh"
>
> but these also ran regardless.
>
> what am I missing?
>
> Brian O'Connor
> -----------------------------------------------------------------------
> SGI Consulting
> Email: briano at sgi.com, Mobile +61 417 746 452
> Phone: +61 3 9963 1900, Fax:  +61 3 9963 1902
> 357 Camberwell Road, Camberwell, Victoria, 3124
> AUSTRALIA
> http://www.sgi.com/support/services
> -----------------------------------------------------------------------
> _______________________________________________
> torqueusers mailing list
> torqueusers at supercluster.org
> http://www.supercluster.org/mailman/listinfo/torqueusers
>
>
>   



More information about the torqueusers mailing list