From akshar.bhosale at gmail.com Wed Sep 12 13:08:36 2012 From: akshar.bhosale at gmail.com (akshar bhosale) Date: Thu, 13 Sep 2012 00:38:36 +0530 Subject: [Mauiusers] reservation diagnosesis Message-ID: Hi, we have torque and maui on rhel 5.2 clustre, When one of the users tried to reserve the reservation, on 25 nodes, showres will show N/P as 25/1600 insted of 25/200. What could be the issue? From akshar.bhosale at gmail.com Wed Sep 12 22:24:06 2012 From: akshar.bhosale at gmail.com (akshar bhosale) Date: Thu, 13 Sep 2012 09:54:06 +0530 Subject: [Mauiusers] Fwd: reservation diagnosesis In-Reply-To: References: Message-ID: Hi, i could search that 1600 is visible to torque whereas maui (maui logs) says it is 200. ---------- Forwarded message ---------- From: akshar bhosale Date: Thu, 13 Sep 2012 00:38:36 +0530 Subject: reservation diagnosesis To: torqueusers at supercluster.org, mauiusers Hi, we have torque and maui on rhel 5.2 clustre, When one of the users tried to reserve the reservation, on 25 nodes, showres will show N/P as 25/1600 insted of 25/200. What could be the issue? From Cyril.Pennanech at lisa.u-pec.fr Tue Sep 4 02:08:05 2012 From: Cyril.Pennanech at lisa.u-pec.fr (Cyril Pennanech) Date: Tue, 04 Sep 2012 08:08:05 -0000 Subject: [Mauiusers] Priority Message-ID: <5045B6D2.6010107@lisa.u-pec.fr> Hi, I have try to define some priority on my queue definition. I have 3 specifics queues + general queue (default) On each specific queue I have defined the node. And on each queue I have tried to defined the priority with maui by QOS and CLASSCFG where the other queue are most priority on default queue but maui (I think) don't put correctly my priority value on the job (view by checkjob -v job_id) I don't why the job didn't herit the priority that I have specify in my config (see under part of my maui config). The only point that it work that is the job run correctly on the node where the queue have been specify. So, in my maui configuration the job in default queue are most priority the other job submit under the other queue... I don't know why or where in my config the priority don't work. Someone have an idea ? Thanks Cyril P. ---- extract maui.cfg file --- BACKFILLPOLICY BESTFIT RESERVATIONPOLICY CURRENTHIGHEST NODEALLOCATIONPOLICY MINRESOURCE QOSCFG[hi] PRIORITY=1000 XFTARGET=100 FLAGS=PREEMPTOR:IGNMAXJOB QOSCFG[low] PRIORITY=-1000 FLAGS=PREEMPTEE QOSCFG[fast] PRIORITY=10000 QFLAGS=IGNSYSTEM NODECFG[node01] FEATURE=queue1 NODECFG[node02] FEATURE=queue1 NODECFG[node03] FEATURE=queue1 NODECFG[node04] FEATURE=queue1 NODECFG[node05] FEATURE=queue2 NODECFG[node06] FEATURE=queue2 NODECFG[node07] FEATURE=queue2 NODECFG[node08] FEATURE=queue2 NODECFG[node09] FEATURE=queue3 NODECFG[node10] FEATURE=queue3 CREDWEIGHT 1 USERWEIGHT 1 GROUPWEIGHT 1 QOSWEIGHT 1 USERCFG[DEFAULT] PRIORITY=1 GROUPCFG[DEFAULT] PRIORITY=1 CLASSCFG[queue1] PRIORITY=1000 HOSTLIST=node05,node06,node07,node08 QDEF=hi CLASSCFG[queue2] PRIORITY=1000 HOSTLIST=node01,node02,node03,node04 QDEF=hi CLASSCFG[queue3] PRIORITY=1000 HOSTLIST=node09,node10 QDEF=hi CLASSCFG[defaultqueue] PRIORITY=1 QDEF=low From s.prabhakaran at grs-sim.de Mon Sep 10 18:12:07 2012 From: s.prabhakaran at grs-sim.de (Suraj Prabhakaran) Date: Tue, 11 Sep 2012 00:12:07 -0000 Subject: [Mauiusers] Maui/Torque with node properties Message-ID: <00DF894B-F5A2-4F73-BD12-1BFC0C6A562A@grs-sim.de> Dear all, I have 4 nodes with the following properties node1 fast node2 fast node3 slow node4 slow Traditionally, torque allows to request nodes with different properties by qsub -l nodes=1:fast+1:slow The above should allocate one fast node and one slow node and this works perfectly fine when pbs_sched is used. But when I use maui as my scheduler, I never get the nodes assigned and end up waiting infinitely. Is this feature supported in maui? Until now, I haven't read anywhere that this feature is not supported in maui. Or, am I just missing something here? Best, Suraj From adaptivecomputing at bridgemailsystem.com Tue Sep 11 08:12:40 2012 From: adaptivecomputing at bridgemailsystem.com (Adaptive Computing) Date: Tue, 11 Sep 2012 14:12:40 -0000 Subject: [Mauiusers] You missed it at VMWorld Message-ID: <14965881.1347372506080.JavaMail.root@mail2.bms.local> An HTML attachment was scrubbed... URL: http://www.supercluster.org/pipermail/mauiusers/attachments/20120911/c3a72742/attachment-0001.html From adaptivecomputing at bridgemailsystem.com Wed Sep 12 08:17:14 2012 From: adaptivecomputing at bridgemailsystem.com (Adaptive Computing) Date: Wed, 12 Sep 2012 14:17:14 -0000 Subject: [Mauiusers] VOTE for Moab at HPCwire Readers' Choice Awards Message-ID: <31682242.1347458808975.JavaMail.root@mail2.bms.local> An HTML attachment was scrubbed... URL: http://www.supercluster.org/pipermail/mauiusers/attachments/20120912/2fbbe688/attachment-0001.html From adaptivecomputing at bridgemailsystem.com Wed Sep 19 08:27:48 2012 From: adaptivecomputing at bridgemailsystem.com (Adaptive Computing) Date: Wed, 19 Sep 2012 07:27:48 -0700 (PDT) Subject: [Mauiusers] You missed it at VMWorld Message-ID: <646745.1348064870329.JavaMail.root@mail4.bridgemailsystem.com> An HTML attachment was scrubbed... URL: http://www.supercluster.org/pipermail/mauiusers/attachments/20120919/0d52d8a4/attachment-0001.html From Cyril.Pennanech at lisa.u-pec.fr Fri Sep 21 01:45:12 2012 From: Cyril.Pennanech at lisa.u-pec.fr (Cyril Pennanech) Date: Fri, 21 Sep 2012 09:45:12 +0200 Subject: [Mauiusers] Jobs priority Message-ID: <505C1B08.4060301@lisa.u-pec.fr> Hi all, I have try to define some priority on my queue definition. I have 3 specifics queues + general queue (default) On each specific queue I have defined the node. And on each queue I have tried to defined the priority with maui by QOS and CLASSCFG where the other queue are most priority on default queue but maui (I think) don't put correctly my priority value on the job (view by checkjob -v job_id) I don't know why the job didn't herit the priority that I have specify in my config (see under part of my maui config). The only point that it work that is the job run correctly on the node where the queue have been specify. So, in my maui configuration the job in default queue are most priority the other job submit under the other queue... I don't know why or where in my config the priority don't work. Someone have an idea ? Thanks for your help Cyril ---- extract maui.cfg file --- BACKFILLPOLICY BESTFIT RESERVATIONPOLICY CURRENTHIGHEST NODEALLOCATIONPOLICY MINRESOURCE QOSCFG[hi] PRIORITY=1000 XFTARGET=100 FLAGS=PREEMPTOR:IGNMAXJOB QOSCFG[low] PRIORITY=-1000 FLAGS=PREEMPTEE QOSCFG[fast] PRIORITY=10000 QFLAGS=IGNSYSTEM NODECFG[node01] FEATURE=queue1 NODECFG[node02] FEATURE=queue1 NODECFG[node03] FEATURE=queue1 NODECFG[node04] FEATURE=queue1 NODECFG[node05] FEATURE=queue2 NODECFG[node06] FEATURE=queue2 NODECFG[node07] FEATURE=queue2 NODECFG[node08] FEATURE=queue2 NODECFG[node09] FEATURE=queue3 NODECFG[node10] FEATURE=queue3 CREDWEIGHT 1 USERWEIGHT 1 GROUPWEIGHT 1 QOSWEIGHT 1 USERCFG[DEFAULT] PRIORITY=1 GROUPCFG[DEFAULT] PRIORITY=1 CLASSCFG[queue1] PRIORITY=1000 HOSTLIST=node05,node06,node07,node08 QDEF=hi CLASSCFG[queue2] PRIORITY=1000 HOSTLIST=node01,node02,node03,node04 QDEF=hi CLASSCFG[queue3] PRIORITY=1000 HOSTLIST=node09,node10 QDEF=hi CLASSCFG[defaultqueue] PRIORITY=1 QDEF=low From danield at igb.uiuc.edu Fri Sep 21 13:54:29 2012 From: danield at igb.uiuc.edu (Daniel Davidson) Date: Fri, 21 Sep 2012 14:54:29 -0500 Subject: [Mauiusers] procs= with torque 3.05-1 and maui 3.3.1-1 Message-ID: <505CC5F5.1040505@igb.uiuc.edu> I am working on finalizing our cluster setup, and as part of that is nailing down the torque/maui config. I have been looking at what happens in maui when someone submits qsub -l procs=x blah.sh to their script. Right now, it looks like maui is ignoring the procs line. Here is an example: bash-4.1$ qsub -I -q test_queue -l procs=6 qsub: waiting for job 76338.biocluster.igb.illinois.edu to start qsub: job 76338.biocluster.igb.illinois.edu ready -bash-4.1$ However, when i do a tracejob: [root at biocluster init.d]# tracejob -v 76338 /var/spool/torque/server_priv/accounting/20120921: Successfully located matching job records /var/spool/torque/server_logs/20120921: Successfully located matching job records /var/spool/torque/mom_logs/20120921: No such file or directory /var/spool/torque/sched_logs/20120921: No such file or directory Job: 76338.biocluster.igb.illinois.edu 09/21/2012 14:47:16 S enqueuing into test_queue, state 1 hop 1 09/21/2012 14:47:16 S Job Queued at request of danield at biocluster.igb.illinois.edu, owner = danield at biocluster.igb.illinois.edu, job name = STDIN, queue = test_queue 09/21/2012 14:47:16 A queue=test_queue 09/21/2012 14:47:17 S Job Run at request of maui at biocluster.igb.illinois.edu 09/21/2012 14:47:17 S Not sending email: User does not want mail of this type. 09/21/2012 14:47:17 A user=danield group=danield jobname=STDIN queue=test_queue ctime=1348256836 qtime=1348256836 etime=1348256836 start=1348256837 owner=danield at biocluster.igb.illinois.edu exec_host=compute-0-1/0 Resource_List.mem=3gb Resource_List.ncpus=1 Resource_List.neednodes=1 Resource_List.nodect=1 Resource_List.nodes=1 Resource_List.procs=6 So it looks like only one processor is reserved. If I change procs=6 to nodes=1:ppn=6 then it works right: [root at biocluster init.d]# tracejob -v 76340 /var/spool/torque/server_priv/accounting/20120921: Successfully located matching job records /var/spool/torque/server_logs/20120921: Successfully located matching job records /var/spool/torque/mom_logs/20120921: No such file or directory /var/spool/torque/sched_logs/20120921: No such file or directory Job: 76340.biocluster.igb.illinois.edu 09/21/2012 14:50:12 S enqueuing into test_queue, state 1 hop 1 09/21/2012 14:50:12 S Job Queued at request of danield at biocluster.igb.illinois.edu, owner = danield at biocluster.igb.illinois.edu, job name = STDIN, queue = test_queue 09/21/2012 14:50:12 A queue=test_queue 09/21/2012 14:50:13 S Job Run at request of maui at biocluster.igb.illinois.edu 09/21/2012 14:50:13 S Not sending email: User does not want mail of this type. 09/21/2012 14:50:13 A user=danield group=danield jobname=STDIN queue=test_queue ctime=1348257012 qtime=1348257012 etime=1348257012 start=1348257013 owner=danield at biocluster.igb.illinois.edu exec_host=compute-0-1/5+compute-0-1/4+compute-0-1/3+compute-0-1/2+compute-0-1/1+compute-0-1/0 Resource_List.mem=3gb Resource_List.ncpus=1 Resource_List.neednodes=1:ppn=6 Resource_List.nodect=1 Resource_List.nodes=1:ppn=6 Can someone let me know why this would be, and why isnt ncpus set correctly in the lastjob. If I am mistaken about what the procs field mean, please let me know. Dan From adaptivecomputing at bridgemailsystem.com Mon Sep 24 16:44:20 2012 From: adaptivecomputing at bridgemailsystem.com (Adaptive Computing) Date: Mon, 24 Sep 2012 15:44:20 -0700 (PDT) Subject: [Mauiusers] You missed it at VMWorld - Avoid the Firing Line Message-ID: <5628799.1348526663924.JavaMail.root@mail4.bridgemailsystem.com> An HTML attachment was scrubbed... URL: http://www.supercluster.org/pipermail/mauiusers/attachments/20120924/f6d86f4a/attachment-0001.html