From s.prabhakaran at grs-sim.de Tue Sep 4 06:14:01 2012 From: s.prabhakaran at grs-sim.de (Suraj Prabhakaran) Date: Tue, 04 Sep 2012 14:14:01 +0200 Subject: [torquedev] Influencing the order of nodes in PBS_NODEFILE Message-ID: <81443EB8-04E8-4ABA-8A09-779461D8F05F@grs-sim.de> Hi all, I have some nodes with 2 different properties, example: node1 fast node2 fast node3 slow node4 slow When I request nodes in this following fashion qsub -l nodes=2:fast+2:slow I observe the $PBS_NODEFILE does not strictly have the nodes listed in the order that was requested but its random. For instance, a strict ordering of the nodes in the order of request would lead to $PBS_NODEFILE with content node1 node2 node3 node4 But I observed something like node3 node2 node4 node1 -------- The question is, is it possible to influence a strict ordering of nodes according to the request? If not, could someone please show me some pointers to the torque code where this can be influenced? Best regards, Suraj From samuel at unimelb.edu.au Tue Sep 4 21:02:32 2012 From: samuel at unimelb.edu.au (Christopher Samuel) Date: Wed, 05 Sep 2012 13:02:32 +1000 Subject: [torquedev] Influencing the order of nodes in PBS_NODEFILE In-Reply-To: <81443EB8-04E8-4ABA-8A09-779461D8F05F@grs-sim.de> References: <81443EB8-04E8-4ABA-8A09-779461D8F05F@grs-sim.de> Message-ID: <5046C0C8.8080208@unimelb.edu.au> -----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 On 04/09/12 22:14, Suraj Prabhakaran wrote: > The question is, is it possible to influence a strict ordering of > nodes according to the request? If not, could someone please show > me some pointers to the torque code where this can be influenced? I'm not sure, but I think the ordering of the nodes is done by the scheduler, not Torque itself (well, unless you use pbs_sched). I could be wrong though.. - -- Christopher Samuel Senior Systems Administrator VLSCI - Victorian Life Sciences Computation Initiative Email: samuel at unimelb.edu.au Phone: +61 (0)3 903 55545 http://www.vlsci.org.au/ http://twitter.com/vlsci -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.11 (GNU/Linux) Comment: Using GnuPG with Mozilla - http://www.enigmail.net/ iEYEARECAAYFAlBGwMgACgkQO2KABBYQAh9p1QCgkmfqeZdb0r6Yuq8Kz67Docns oW4An2lXl+Pc0dMIXWc43S5usf2ERV8L =Yjgx -----END PGP SIGNATURE----- From samuel at unimelb.edu.au Tue Sep 4 21:05:32 2012 From: samuel at unimelb.edu.au (Christopher Samuel) Date: Wed, 05 Sep 2012 13:05:32 +1000 Subject: [torquedev] [torqueusers] torque IRC channel In-Reply-To: <5022773D.1040507@cyf-kr.edu.pl> References: <5022773D.1040507@cyf-kr.edu.pl> Message-ID: <5046C17C.4030401@unimelb.edu.au> -----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 On 09/08/12 00:27, Lukasz Flis wrote: > Is there any IRC channel for Torque community and/or developers > available? I've not used it for decades now, but nothing to stop those that do from congregating in #torque if they so want to. cheers, Chris - -- Christopher Samuel Senior Systems Administrator VLSCI - Victorian Life Sciences Computation Initiative Email: samuel at unimelb.edu.au Phone: +61 (0)3 903 55545 http://www.vlsci.org.au/ http://twitter.com/vlsci -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.11 (GNU/Linux) Comment: Using GnuPG with Mozilla - http://www.enigmail.net/ iEYEARECAAYFAlBGwXwACgkQO2KABBYQAh/lhACfZvaJGzhm18CkzSEA8nYdrccZ sa0AnRKl6gqeec6Nw+b0XZfheYJNPyOC =tTRX -----END PGP SIGNATURE----- From s.prabhakaran at grs-sim.de Mon Sep 10 12:40:17 2012 From: s.prabhakaran at grs-sim.de (Suraj Prabhakaran) Date: Mon, 10 Sep 2012 20:40:17 +0200 Subject: [torquedev] Maui/Torque with node properties Message-ID: <00DF894B-F5A2-4F73-BD12-1BFC0C6A562A@grs-sim.de> Dear all, I have 4 nodes with the following properties node1 fast node2 fast node3 slow node4 slow Traditionally, torque allows to request nodes with different properties by qsub -l nodes=1:fast+1:slow The above should allocate one fast node and one slow node and this works perfectly fine when pbs_sched is used. But when I use maui as my scheduler, I never get the nodes assigned and end up waiting infinitely. Is this feature supported in maui? Until now, I haven't read anywhere that this feature is not supported in maui. Or, am I just missing something here? Best, Suraj From bugzilla-daemon at supercluster.org Tue Sep 11 02:19:48 2012 From: bugzilla-daemon at supercluster.org (bugzilla-daemon at supercluster.org) Date: Tue, 11 Sep 2012 02:19:48 -0600 (MDT) Subject: [torquedev] [Bug 215] New: if qdel is used with "-t" option, qdel should add "[]" to JOBID. Message-ID: http://www.clusterresources.com/bugzilla/show_bug.cgi?id=215 Summary: if qdel is used with "-t" option, qdel should add "[]" to JOBID. Product: TORQUE Version: 4.0.* Platform: PC OS/Version: Linux Status: NEW Severity: minor Priority: P5 Component: clients AssignedTo: knielson at adaptivecomputing.com ReportedBy: sstcosaka01 at gmail.com CC: torquedev at supercluster.org Estimated Hours: 0.0 Created an attachment (id=118) --> (http://www.clusterresources.com/bugzilla/attachment.cgi?id=118) qdel.c modification We can not cancel jobs in a job array with qdel -t array_range JOBID with errors like qdel: Unauthorized Request MSG=must have operator or manager privilege to use -m parameter 446.torque02 If we modify torque-4.1.1/src/cmds/qdel.c with patch.qdel.c so that "[]" is added to job_id, we can cancel jobs in a job array with qdel -t. There is a workaround that we can cancel jobs with qdel -t adding "[]" to jobid at command line. (Good) qdel -t 4-8 479[] (Bad) qdel -t 4-8 479 We have checked this with torque-4.1.1 and torque-4.1.0. ((qdel -t with modified qdel)) /dev/shm/usr/local/torque/bin/qdel: modified qdel /usr/local/torque/bin/qdel: original qdel [test01 at torque02 ~]$ qstat -t Job id Name User Time Use S Queue ------------------------- ---------------- --------------- -------- - ----- 445[5].torque02 STDIN-5 test01 00:00:00 C batch 445[6].torque02 STDIN-6 test01 00:00:00 C batch 445[7].torque02 STDIN-7 test01 00:00:00 C batch 445[8].torque02 STDIN-8 test01 00:00:00 C batch 446[1].torque02 STDIN-1 test01 0 R batch 446[2].torque02 STDIN-2 test01 0 R batch 446[3].torque02 STDIN-3 test01 0 R batch 446[4].torque02 STDIN-4 test01 0 R batch 446[5].torque02 STDIN-5 test01 0 R batch 446[6].torque02 STDIN-6 test01 0 R batch 446[7].torque02 STDIN-7 test01 0 R batch 446[8].torque02 STDIN-8 test01 0 R batch [test01 at torque02 ~]$ /dev/shm/usr/local/torque/bin/qdel -t 5-8 446 [test01 at torque02 ~]$ qdel -t 1-4 446 qdel: Unauthorized Request MSG=must have operator or manager privilege to use -m parameter 446.torque02 ((workarround)) [test01 at torque02 ~]$ qstat -t Job id Name User Time Use S Queue ------------------------- ---------------- --------------- -------- - ----- 479[1].torque02 NPB01-1 test01 0 Q batch 479[2].torque02 NPB01-2 test01 0 Q batch 479[3].torque02 NPB01-3 test01 0 Q batch 479[4].torque02 NPB01-4 test01 0 Q batch 479[5].torque02 NPB01-5 test01 0 Q batch 479[6].torque02 NPB01-6 test01 0 Q batch 479[7].torque02 NPB01-7 test01 0 Q batch 479[8].torque02 NPB01-8 test01 0 Q batch [test01 at torque02 ~]$ qdel -t 4-8 479 qdel: Unauthorized Request MSG=must have operator or manager privilege to use -m parameter 479.torque02.ahoaho [test01 at torque02 ~]$ qdel -t 4-8 479[] [test01 at torque02 ~]$ qstat -t Job id Name User Time Use S Queue ------------------------- ---------------- --------------- -------- - ----- 479[1].torque02 NPB01-1 test01 0 R batch 479[2].torque02 NPB01-2 test01 0 R batch 479[3].torque02 NPB01-3 test01 0 Q batch 479[4].torque02 NPB01-4 test01 0 C batch 479[5].torque02 NPB01-5 test01 0 C batch 479[6].torque02 NPB01-6 test01 0 C batch 479[7].torque02 NPB01-7 test01 0 C batch 479[8].torque02 NPB01-8 test01 0 C batch thank you go --- -- Configure bugmail: http://www.clusterresources.com/bugzilla/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug. From bugzilla-daemon at supercluster.org Tue Sep 11 02:34:21 2012 From: bugzilla-daemon at supercluster.org (bugzilla-daemon at supercluster.org) Date: Tue, 11 Sep 2012 02:34:21 -0600 (MDT) Subject: [torquedev] [Bug 215] if qdel is used with "-t" option, qdel should add "[]" to JOBID. In-Reply-To: References: Message-ID: <20120911083421.C29453EA8299@http.supercluster.org> http://www.clusterresources.com/bugzilla/show_bug.cgi?id=215 sstcosaka01 at gmail.com changed: What |Removed |Added ---------------------------------------------------------------------------- Attachment #118|qdel.c modification |patch.qdel.c description| | -- Configure bugmail: http://www.clusterresources.com/bugzilla/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug. From bugzilla-daemon at supercluster.org Thu Sep 13 19:30:37 2012 From: bugzilla-daemon at supercluster.org (bugzilla-daemon at supercluster.org) Date: Thu, 13 Sep 2012 19:30:37 -0600 (MDT) Subject: [torquedev] [Bug 209] pbs_server rejects Obits with cray_enabled In-Reply-To: References: Message-ID: <20120914013037.2E0A567810D@http.supercluster.org> http://www.clusterresources.com/bugzilla/show_bug.cgi?id=209 Matt Ezell changed: What |Removed |Added ---------------------------------------------------------------------------- Status|NEW |RESOLVED Resolution| |FIXED --- Comment #3 from Matt Ezell 2012-09-13 19:30:34 MDT --- I haven't seen this again. Closing. -- Configure bugmail: http://www.clusterresources.com/bugzilla/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug. From bugzilla-daemon at supercluster.org Thu Sep 13 19:42:02 2012 From: bugzilla-daemon at supercluster.org (bugzilla-daemon at supercluster.org) Date: Thu, 13 Sep 2012 19:42:02 -0600 (MDT) Subject: [torquedev] [Bug 112] 'qstat -a' 'tasks' column width too small In-Reply-To: References: Message-ID: <20120914014202.5653E4121140@http.supercluster.org> http://www.clusterresources.com/bugzilla/show_bug.cgi?id=112 Matt Ezell changed: What |Removed |Added ---------------------------------------------------------------------------- Status|NEW |RESOLVED CC| |ezellma at ornl.gov Resolution| |FIXED --- Comment #1 from Matt Ezell 2012-09-13 19:42:01 MDT --- It looks like this has been fixed. Right now, we have '#define DEFTASKSIZE 6'. Closing. -- Configure bugmail: http://www.clusterresources.com/bugzilla/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug. From bugzilla-daemon at supercluster.org Sun Sep 16 06:08:28 2012 From: bugzilla-daemon at supercluster.org (bugzilla-daemon at supercluster.org) Date: Sun, 16 Sep 2012 06:08:28 -0600 (MDT) Subject: [torquedev] [Bug 216] New: array job crashes server if display_job_server_suffix = False Message-ID: http://www.clusterresources.com/bugzilla/show_bug.cgi?id=216 Summary: array job crashes server if display_job_server_suffix = False Product: TORQUE Version: 2.5.x Platform: PC OS/Version: Linux Status: NEW Severity: enhancement Priority: P5 Component: pbs_server AssignedTo: dbeer at adaptivecomputing.com ReportedBy: bunk at physik.hu-berlin.de CC: torquedev at supercluster.org Estimated Hours: 0.0 Hi, after setting set server display_job_server_suffix = False (such that JobID = job_number), any attempt to submit an array job with qsub -t