From bugzilla-daemon at supercluster.org Wed Aug 1 00:25:43 2012 From: bugzilla-daemon at supercluster.org (bugzilla-daemon at supercluster.org) Date: Wed, 1 Aug 2012 00:25:43 -0600 (MDT) Subject: [torquedev] [Bug 211] pbs_sched does not read TRQ_IFNAME In-Reply-To: References: Message-ID: <20120801062543.A164C412151C@http.supercluster.org> http://www.clusterresources.com/bugzilla/show_bug.cgi?id=211 Chris Samuel changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |chris at csamuel.org --- Comment #1 from Chris Samuel 2012-08-01 00:25:43 MDT --- There's not a lot of change between the 2.5 pbs_sched.c and the 4.x, it introduces some locks via mutex's but the place where this goes appears to be outside of those. -- Configure bugmail: http://www.clusterresources.com/bugzilla/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug. From bugzilla-daemon at supercluster.org Wed Aug 1 04:26:12 2012 From: bugzilla-daemon at supercluster.org (bugzilla-daemon at supercluster.org) Date: Wed, 1 Aug 2012 04:26:12 -0600 (MDT) Subject: [torquedev] [Bug 211] pbs_sched does not read TRQ_IFNAME In-Reply-To: References: Message-ID: <20120801102612.145A84121533@http.supercluster.org> http://www.clusterresources.com/bugzilla/show_bug.cgi?id=211 --- Comment #2 from Taras 2012-08-01 04:26:11 MDT --- Created an attachment (id=115) --> (http://www.clusterresources.com/bugzilla/attachment.cgi?id=115) Using TRQ_IFNAME in pbs_sched -- Configure bugmail: http://www.clusterresources.com/bugzilla/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug. From bugzilla-daemon at supercluster.org Wed Aug 1 04:27:01 2012 From: bugzilla-daemon at supercluster.org (bugzilla-daemon at supercluster.org) Date: Wed, 1 Aug 2012 04:27:01 -0600 (MDT) Subject: [torquedev] [Bug 211] pbs_sched does not read TRQ_IFNAME In-Reply-To: References: Message-ID: <20120801102701.AA0BE4121536@http.supercluster.org> http://www.clusterresources.com/bugzilla/show_bug.cgi?id=211 --- Comment #3 from Taras 2012-08-01 04:27:01 MDT --- Created an attachment (id=116) --> (http://www.clusterresources.com/bugzilla/attachment.cgi?id=116) Using TRQ_IFNAME in pbs_sched -- Configure bugmail: http://www.clusterresources.com/bugzilla/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug. From bugzilla-daemon at supercluster.org Wed Aug 1 04:28:29 2012 From: bugzilla-daemon at supercluster.org (bugzilla-daemon at supercluster.org) Date: Wed, 1 Aug 2012 04:28:29 -0600 (MDT) Subject: [torquedev] [Bug 211] pbs_sched does not read TRQ_IFNAME In-Reply-To: References: Message-ID: <20120801102829.71865412333F@http.supercluster.org> http://www.clusterresources.com/bugzilla/show_bug.cgi?id=211 --- Comment #4 from Taras 2012-08-01 04:28:29 MDT --- (In reply to comment #1) > There's not a lot of change between the 2.5 pbs_sched.c and the 4.x, it > introduces some locks via mutex's but the place where this goes appears to be > outside of those. Ok, I've updated the patch for v2 and v3 and created the new one for v4. The patches are attached. -- Configure bugmail: http://www.clusterresources.com/bugzilla/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug. From bugzilla-daemon at supercluster.org Wed Aug 1 08:58:30 2012 From: bugzilla-daemon at supercluster.org (bugzilla-daemon at supercluster.org) Date: Wed, 1 Aug 2012 08:58:30 -0600 (MDT) Subject: [torquedev] [Bug 211] pbs_sched does not read TRQ_IFNAME In-Reply-To: References: Message-ID: <20120801145830.5CA0841219BF@http.supercluster.org> http://www.clusterresources.com/bugzilla/show_bug.cgi?id=211 Ken Nielson changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |knielson at adaptivecomputing. | |com --- Comment #5 from Ken Nielson 2012-08-01 08:58:30 MDT --- Before we do anything with this patch it needs to be understood that torque.cfg is intended to be used for qsub only. By allowing a torque.cfg parameter to used outside of qsub changes the paradigm. Comments? -- Configure bugmail: http://www.clusterresources.com/bugzilla/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug. From bugzilla-daemon at supercluster.org Wed Aug 1 14:03:31 2012 From: bugzilla-daemon at supercluster.org (bugzilla-daemon at supercluster.org) Date: Wed, 1 Aug 2012 14:03:31 -0600 (MDT) Subject: [torquedev] [Bug 210] Cannot configure 4.1-fixes on Ubuntu In-Reply-To: References: Message-ID: <20120801200331.BA11F41213F4@http.supercluster.org> http://www.clusterresources.com/bugzilla/show_bug.cgi?id=210 --- Comment #3 from Michael Jennings 2012-08-01 14:03:31 MDT --- Oops, sorry. Fixed patch: Index: configure.ac =================================================================== --- configure.ac (revision 6611) +++ configure.ac (working copy) @@ -702,10 +702,10 @@ dnl dnl we need libxml2 dnl -xmlLib=`xml2-config --libs` +xmlLib=`xml2-config --libs | sed 's/-L@<:@^@<:@:space:@:>@@:>@* //g;s/-l//'` dnl skip the first two chars because its -l -AC_CHECK_LIB(${xmlLib:2}, xmlGetProp, +AC_CHECK_LIB(${xmlLib}, xmlGetProp, [], [AC_MSG_ERROR([TORQUE needs libxml2-devel in order to build]) ]) Darn autoSPLATisms. >:I -- Configure bugmail: http://www.clusterresources.com/bugzilla/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug. From bugzilla-daemon at supercluster.org Wed Aug 1 22:20:42 2012 From: bugzilla-daemon at supercluster.org (bugzilla-daemon at supercluster.org) Date: Wed, 1 Aug 2012 22:20:42 -0600 (MDT) Subject: [torquedev] [Bug 210] Cannot configure 4.1-fixes on Ubuntu In-Reply-To: References: Message-ID: <20120802042042.4EC3F412297F@http.supercluster.org> http://www.clusterresources.com/bugzilla/show_bug.cgi?id=210 --- Comment #4 from Chris Samuel 2012-08-01 22:20:42 MDT --- Thanks Michael, that works! Oh well, at least it's not cmake or scons.. ;-) -- Configure bugmail: http://www.clusterresources.com/bugzilla/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug. From bugzilla-daemon at supercluster.org Wed Aug 1 22:24:25 2012 From: bugzilla-daemon at supercluster.org (bugzilla-daemon at supercluster.org) Date: Wed, 1 Aug 2012 22:24:25 -0600 (MDT) Subject: [torquedev] [Bug 211] pbs_sched does not read TRQ_IFNAME In-Reply-To: References: Message-ID: <20120802042426.001894122F2B@http.supercluster.org> http://www.clusterresources.com/bugzilla/show_bug.cgi?id=211 --- Comment #6 from Chris Samuel 2012-08-01 22:24:25 MDT --- Good catch Ken, that was a subtlety that had passed me by completely! It's been almost a decade since we used pbs_sched so I can't recall if it has any config options of its own.. -- Configure bugmail: http://www.clusterresources.com/bugzilla/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug. From bugzilla-daemon at supercluster.org Fri Aug 3 11:25:36 2012 From: bugzilla-daemon at supercluster.org (bugzilla-daemon at supercluster.org) Date: Fri, 3 Aug 2012 11:25:36 -0600 (MDT) Subject: [torquedev] [Bug 208] pbs_mom segfaults in tm_request In-Reply-To: References: Message-ID: <20120803172536.DE371412313C@http.supercluster.org> http://www.clusterresources.com/bugzilla/show_bug.cgi?id=208 --- Comment #1 from Ken Nielson 2012-08-03 11:25:36 MDT --- Martin, Do you happen to have a back trace of the core for this? -- Configure bugmail: http://www.clusterresources.com/bugzilla/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug. From bugzilla-daemon at supercluster.org Mon Aug 6 04:55:37 2012 From: bugzilla-daemon at supercluster.org (bugzilla-daemon at supercluster.org) Date: Mon, 6 Aug 2012 04:55:37 -0600 (MDT) Subject: [torquedev] [Bug 212] New: server spins on select() with expired sockets Message-ID: http://www.clusterresources.com/bugzilla/show_bug.cgi?id=212 Summary: server spins on select() with expired sockets Product: TORQUE Version: 4.0.* Platform: PC OS/Version: Linux Status: NEW Severity: major Priority: P5 Component: pbs_server AssignedTo: dbeer at adaptivecomputing.com ReportedBy: viktor.stujber at stuba.sk CC: torquedev at supercluster.org Estimated Hours: 0.0 Our torque 4.1.0 server often goes into a cpu-consuming loop. Here's information I gathered so far. > strace -p 28590 select(1024, [8 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 33 34 35 36 37 38 39 41 43 44], NULL, NULL, {5, 0}) = 31 (in [8 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 33 34 35 36 37 38 39 41 43 44], left {4, 999992}) nanosleep({0, 100000}, NULL) = 0 tgkill(28590, 28592, SIG_0) = 0 tgkill(28590, 28593, SIG_0) = 0 (gdb) p svr_conn[8] $34 = {cn_addr = 2477722413, cn_handle = 0, cn_port = 15002, cn_authen = 1, cn_socktype = 2, cn_active = ToServerDIS, cn_lasttime = 1344247871, cn_func = 0, cn_oncl = 0, cn_mutex = 0x2a12ce0, cn_stay_open = 0} I shut down all the connected clients, and all of the abovementioned socket IDs are in the CLOSE_WAIT state. The select() call signals activity on all 31 sockets and returns immediately, but all of them are of type 'ToServerDIS' (not 'Idle'), and none of them have a cn_func assigned, so the code in lib/Libnet/net_server.c wait_request() just keeps spinning over all 10240 blank connection slots with no sleep, causing significant cpu usage. -- Configure bugmail: http://www.clusterresources.com/bugzilla/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug. From bugzilla-daemon at supercluster.org Mon Aug 6 22:52:19 2012 From: bugzilla-daemon at supercluster.org (bugzilla-daemon at supercluster.org) Date: Mon, 6 Aug 2012 22:52:19 -0600 (MDT) Subject: [torquedev] [Bug 206] Nodes start with state FREE when starting pbs_server In-Reply-To: References: Message-ID: <20120807045219.EE2AB3EA81F5@http.supercluster.org> http://www.clusterresources.com/bugzilla/show_bug.cgi?id=206 --- Comment #1 from Craig West 2012-08-06 22:52:19 MDT --- After looking through the code I believe I have found some useful information. The file affected is server/node_manager.c It appears to have happened around r4798 or r4799, but I can't be sure. For simplicity I will attach a patch for "my" fix, which has not been extensively tested. /* ** The only state that carries forward is if the ** node has been marked offline. */ while ((np = next_host(&allnodes,&iter,NULL)) != NULL) { if (!(np->nd_state & INUSE_OFFLINE)) { fprintf(nstatef, fmt, np->nd_name, np->nd_state & savemask); } If I remove the "!" on the line with "np->nd_state" it seems to work, and it follows the statement above the code block now. As stated in my previous description of the problem the earlier versions of Torque put a "1" in the server_priv/node_state file for nodes that ARE offline. The current code puts a "0" if the node is NOT offline. If the node is not in the server_priv/node_state file then problem doesn't appear. Craig. -- Configure bugmail: http://www.clusterresources.com/bugzilla/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug. From bugzilla-daemon at supercluster.org Mon Aug 6 22:55:55 2012 From: bugzilla-daemon at supercluster.org (bugzilla-daemon at supercluster.org) Date: Mon, 6 Aug 2012 22:55:55 -0600 (MDT) Subject: [torquedev] [Bug 206] Nodes start with state FREE when starting pbs_server In-Reply-To: References: Message-ID: <20120807045555.B0BCF3EA81F6@http.supercluster.org> http://www.clusterresources.com/bugzilla/show_bug.cgi?id=206 --- Comment #2 from Craig West 2012-08-06 22:55:55 MDT --- Created an attachment (id=117) --> (http://www.clusterresources.com/bugzilla/attachment.cgi?id=117) Makes nodes appear in server_priv if the are OFFLINE Nodes will appear in server_priv if they are offline. -- Configure bugmail: http://www.clusterresources.com/bugzilla/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug. From l.flis at cyf-kr.edu.pl Wed Aug 8 08:27:09 2012 From: l.flis at cyf-kr.edu.pl (Lukasz Flis) Date: Wed, 08 Aug 2012 16:27:09 +0200 Subject: [torquedev] torque IRC channel Message-ID: <5022773D.1040507@cyf-kr.edu.pl> Hi, Is there any IRC channel for Torque community and/or developers available? I've checked #torque on freenode which is registered but empty. I think it could be useful to have more interactive, low latency :) contact with torque community Cheers -- Lukasz Flis From s.prabhakaran at grs-sim.de Tue Aug 7 05:43:23 2012 From: s.prabhakaran at grs-sim.de (Suraj Prabhakaran) Date: Tue, 07 Aug 2012 13:43:23 +0200 Subject: [torquedev] Torque database In-Reply-To: References: <20120707025158.GA31089@stikine.sfu.ca> <20120710000706.GA14070@stikine.sfu.ca> Message-ID: Dear Ken, Thank you very much for your reply. Does that mean that Torque does not store the information internally (in a structure or something) but only directly in the file? In the same lines, if one would edit/delete the JB file at the headnode, or at the mom, what would the behavior be like? Thanks, Suraj On Jul 12, 2012, at 10:38 PM, Ken Nielson wrote: > > > On Tue, Jul 10, 2012 at 5:25 AM, Suraj Prabhakaran wrote: > Dear all, > > I am new to torque and I would like to know a few basic things. Could someone please tell me how the job information is stored in torque? In the sense, are active and queued job infos stored in files or only stored internally? > > Best regards, > Suraj > _______________________________________________ > torquedev mailing list > torquedev at supercluster.org > http://www.supercluster.org/mailman/listinfo/torquedev > Suraj, > > Job files are stored under $TORQUE_HOME/server_priv/jobs. If you submit a job you will see the job id with the extension .JB and also a file with a .SC extension. When a MOM has a job running it also has a copy of these files in the $TORQUE_HOME/mom_priv/jobs directory. > > The .JB file is opaque but the .SC file will contain the contents of the script file that was entered for the job. > > Ken > _______________________________________________ > torquedev mailing list > torquedev at supercluster.org > http://www.supercluster.org/mailman/listinfo/torquedev -------------------------- Suraj Prabhakaran German Research School for Simulation Sciences GmbH Laboratory for Parallel Progreamming 52062 Aachen | Germany Tel +49 241 80 99743 Fax +49 241 80 92742 EMail s.prabhakaran at grs-sim.de Web www.grs-sim.de Members: Forschungszentrum J?lich GmbH | RWTH Aachen University Registered in the commercial register of the local court of D?ren (Amtsgericht D?ren) under registration number HRB 5268 Registered office: J?lich Executive board: Prof. Marek Behr Ph.D. | Dr. Norbert Drewes -------------- next part -------------- An HTML attachment was scrubbed... URL: http://www.supercluster.org/pipermail/torquedev/attachments/20120807/7c41ded4/attachment-0001.html From bugzilla-daemon at supercluster.org Fri Aug 10 04:13:22 2012 From: bugzilla-daemon at supercluster.org (bugzilla-daemon at supercluster.org) Date: Fri, 10 Aug 2012 04:13:22 -0600 (MDT) Subject: [torquedev] [Bug 211] pbs_sched does not read TRQ_IFNAME In-Reply-To: References: Message-ID: <20120810101322.126B1678053@http.supercluster.org> http://www.clusterresources.com/bugzilla/show_bug.cgi?id=211 --- Comment #7 from Taras 2012-08-10 04:13:21 MDT --- I wonder is there any reason why pbs_sched does not listening to any address (as PBS Pro pbs_sched does)? -- Configure bugmail: http://www.clusterresources.com/bugzilla/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug. From bugzilla-daemon at supercluster.org Fri Aug 10 06:38:23 2012 From: bugzilla-daemon at supercluster.org (bugzilla-daemon at supercluster.org) Date: Fri, 10 Aug 2012 06:38:23 -0600 (MDT) Subject: [torquedev] [Bug 213] New: sendmail path in torque.spec influenced by _prefix Message-ID: http://www.clusterresources.com/bugzilla/show_bug.cgi?id=213 Summary: sendmail path in torque.spec influenced by _prefix Product: TORQUE Version: 2.5.x Platform: PC OS/Version: Linux Status: NEW Severity: normal Priority: P5 Component: pbs_server AssignedTo: dbeer at adaptivecomputing.com ReportedBy: viktor.stujber at stuba.sk CC: torquedev at supercluster.org Estimated Hours: 0.0 When using rpmbuild, the torque.spec file supplies its own --with-sendmail=%{sendmail_path} parameter to configure (instead of letting configure autodetect it).It expands to %{!?sendmail_path:%global sendmail_path %{_sbindir}/sendmail}. So it's either an user-supplied path, or _sbindir (which is already an incorrect assumption). However, this is referring to the build's sbindir, not the system's. We use --define "_prefix /opt/torque" to change the install path, and apparently _prefix influences _sbindir, because all Makefiles get SENDMAIL_CMD = /opt/torque/sbin/sendmail which is a nonexistent path. My suggestion is to make the entire --with-sendmail configure parameter optional, only instantiate it when the user provides a specific path, and leave it up to the default configure behavior otherwise. -- Configure bugmail: http://www.clusterresources.com/bugzilla/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug. From bugzilla-daemon at supercluster.org Fri Aug 10 12:29:28 2012 From: bugzilla-daemon at supercluster.org (bugzilla-daemon at supercluster.org) Date: Fri, 10 Aug 2012 12:29:28 -0600 (MDT) Subject: [torquedev] [Bug 213] sendmail path in torque.spec influenced by _prefix In-Reply-To: References: Message-ID: <20120810182928.32C81412127E@http.supercluster.org> http://www.clusterresources.com/bugzilla/show_bug.cgi?id=213 Michael Jennings changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |mej at lbl.gov --- Comment #1 from Michael Jennings 2012-08-10 12:29:28 MDT --- The spec file behavior is as expected. To override the default assumptions, simply pass the following parameters to rpmbuild: --define 'sendmail_path /path/to/your/sendmail' -- Configure bugmail: http://www.clusterresources.com/bugzilla/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug. From bugzilla-daemon at supercluster.org Fri Aug 10 13:39:18 2012 From: bugzilla-daemon at supercluster.org (bugzilla-daemon at supercluster.org) Date: Fri, 10 Aug 2012 13:39:18 -0600 (MDT) Subject: [torquedev] [Bug 213] sendmail path in torque.spec influenced by _prefix In-Reply-To: References: Message-ID: <20120810193918.40F5A41213BB@http.supercluster.org> http://www.clusterresources.com/bugzilla/show_bug.cgi?id=213 --- Comment #2 from Viktor ?tujber 2012-08-10 13:39:18 MDT --- So you consider torque trying to run /var/spool/torque/sbin/sendmail to be normal? The workaround you posted does work, but I'd prefer to use the defaults... and the default is flawed - it's hardcoded to SBIN/sendmail, however in our SL6 install it's /usr/lib/sendmail which symlinks to mta-postfix. The default configure script locates this correctly, but torque.spec inserts its own value and breaks it. -- Configure bugmail: http://www.clusterresources.com/bugzilla/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug. From bugzilla-daemon at supercluster.org Fri Aug 10 15:38:34 2012 From: bugzilla-daemon at supercluster.org (bugzilla-daemon at supercluster.org) Date: Fri, 10 Aug 2012 15:38:34 -0600 (MDT) Subject: [torquedev] [Bug 213] sendmail path in torque.spec influenced by _prefix In-Reply-To: References: Message-ID: <20120810213834.37C53412184B@http.supercluster.org> http://www.clusterresources.com/bugzilla/show_bug.cgi?id=213 --- Comment #3 from Michael Jennings 2012-08-10 15:38:34 MDT --- (In reply to comment #2) > So you consider torque trying to run /var/spool/torque/sbin/sendmail to be > normal? Your attempt to be snidely witty is noted and appreciated, but --prefix/%{_prefix} and --with-server-home/%{torque_home} are entirely separate and very different settings/variables. :-) But the point is worth making that there's a reason prefix and server home directory are distinct settings, and you've crystallized it quite well in the above. > The workaround you posted does work, but I'd prefer to use the defaults... It's not a "workaround." It's the correct way to override the default assumptions, one of those being that your system is properly configured, and that sendmail is in the correct location mandated by relevant standards. > and the default is flawed - it's hardcoded to SBIN/sendmail, however in our SL6 > install it's /usr/lib/sendmail which symlinks to mta-postfix. $ ls -Fla /usr/*/sendmail* lrwxrwxrwx. 1 root root 30 Jun 21 2011 /usr/lib/sendmail -> /etc/alternatives/mta-sendmail* lrwxrwxrwx. 1 root root 24 Jun 21 2011 /usr/lib/sendmail.postfix -> ../sbin/sendmail.postfix* lrwxrwxrwx. 1 root root 21 Jun 21 2011 /usr/sbin/sendmail -> /etc/alternatives/mta* -rwxr-xr-x. 1 root root 213616 Jun 2 2011 /usr/sbin/sendmail.postfix* That's on SL6, by the way. As you can see, the canonical location is, indeed /usr/sbin/sendmail. > configure script locates this correctly, but torque.spec inserts its own value > and breaks it. If you do not have /usr/sbin/sendmail on your system, your installation is broken and needs to be fixed. :-) -- Configure bugmail: http://www.clusterresources.com/bugzilla/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug. From bugzilla-daemon at supercluster.org Sun Aug 12 23:13:33 2012 From: bugzilla-daemon at supercluster.org (bugzilla-daemon at supercluster.org) Date: Sun, 12 Aug 2012 23:13:33 -0600 (MDT) Subject: [torquedev] [Bug 213] sendmail path in torque.spec influenced by _prefix In-Reply-To: References: Message-ID: <20120813051333.93354412176C@http.supercluster.org> http://www.clusterresources.com/bugzilla/show_bug.cgi?id=213 Chris Samuel changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |chris at csamuel.org --- Comment #4 from Chris Samuel 2012-08-12 23:13:33 MDT --- Whilst we don't use RPMs for Torque here I believe that building Torque RPMs with --prefix should not change where it looks for the sendmail binary. IMHO prefix should only affect where the package you are building is putting files, not where it is expecting to find system binaries. Regarding the correct location for sendmail, this FHS document explains it well: http://www.pathname.com/fhs/2.2/fhs-4.7.html # For historical reasons, /usr/lib/sendmail must be a symbolic link # to /usr/sbin/sendmail if the latter exists. -- Configure bugmail: http://www.clusterresources.com/bugzilla/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug. From bugzilla-daemon at supercluster.org Mon Aug 13 07:57:36 2012 From: bugzilla-daemon at supercluster.org (bugzilla-daemon at supercluster.org) Date: Mon, 13 Aug 2012 07:57:36 -0600 (MDT) Subject: [torquedev] [Bug 214] New: qsub weird behavior when using filter script Message-ID: http://www.clusterresources.com/bugzilla/show_bug.cgi?id=214 Summary: qsub weird behavior when using filter script Product: TORQUE Version: 2.5.x Platform: PC OS/Version: Windows Status: NEW Severity: enhancement Priority: P5 Component: clients AssignedTo: knielson at adaptivecomputing.com ReportedBy: tgray26 at umiacs.umd.edu CC: torquedev at supercluster.org Estimated Hours: 0.0 When using a filter script I've found that qsub will not honor the resource list if the -I (interactive) flag appears before the -l (resource list) in the arguments. I.E. if I have a filter (even with just #!/bin/bash and nothing else) and I run "qsub -I -l mem=12GB" I will get the queue defaults instead of 12GB, but if I run "qsub -l mem=12GB -I" I will get the 12GB. -- Configure bugmail: http://www.clusterresources.com/bugzilla/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug. From bugzilla-daemon at supercluster.org Mon Aug 13 07:58:02 2012 From: bugzilla-daemon at supercluster.org (bugzilla-daemon at supercluster.org) Date: Mon, 13 Aug 2012 07:58:02 -0600 (MDT) Subject: [torquedev] [Bug 214] qsub weird behavior when using filter script In-Reply-To: References: Message-ID: <20120813135802.9B3E1412298F@http.supercluster.org> http://www.clusterresources.com/bugzilla/show_bug.cgi?id=214 tgray26 changed: What |Removed |Added ---------------------------------------------------------------------------- Platform|PC |All OS/Version|Windows |Linux -- Configure bugmail: http://www.clusterresources.com/bugzilla/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug. From knielson at adaptivecomputing.com Mon Aug 13 11:59:01 2012 From: knielson at adaptivecomputing.com (Ken Nielson) Date: Mon, 13 Aug 2012 11:59:01 -0600 Subject: [torquedev] TORQUE mailing lists back up Message-ID: Hi all, You may have noticed the TORQUE mailing list has been pretty quiet the last few days. The problems have been fixed and we are back up. Regards Ken Nielson Adaptive Computing -------------- next part -------------- An HTML attachment was scrubbed... URL: http://www.supercluster.org/pipermail/torquedev/attachments/20120813/7c0a4231/attachment.html From bugzilla-daemon at supercluster.org Mon Aug 13 13:12:48 2012 From: bugzilla-daemon at supercluster.org (bugzilla-daemon at supercluster.org) Date: Mon, 13 Aug 2012 13:12:48 -0600 (MDT) Subject: [torquedev] [Bug 213] sendmail path in torque.spec influenced by _prefix In-Reply-To: References: Message-ID: <20120813191248.71ED34121479@http.supercluster.org> http://www.clusterresources.com/bugzilla/show_bug.cgi?id=213 --- Comment #5 from Michael Jennings 2012-08-13 13:12:48 MDT --- (In reply to comment #4) > Whilst we don't use RPMs for Torque here I believe that building Torque RPMs > with --prefix should not change where it looks for the sendmail binary. IMHO > prefix should only affect where the package you are building is putting files, > not where it is expecting to find system binaries. That's certainly a reasonable comment. Perhaps it should be changed to "/usr/sbin/sendmail" instead of "%{_prefix}/sendmail" to avoid this. Unfortunately, the auto-detection is unreliable and failure not properly flagged which is why the setting exists to begin with. Ultimately, the most correct solution might be to allow --with-sendmail to be specified without a path so that auto-detection can occur but make failure a fatal error. Too many times, mysterious behavior has eventually been tracked to non-fatal sendmail binary detection failures at RPM build time. :-/ -- Configure bugmail: http://www.clusterresources.com/bugzilla/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug. From bugzilla-daemon at supercluster.org Mon Aug 13 19:03:27 2012 From: bugzilla-daemon at supercluster.org (bugzilla-daemon at supercluster.org) Date: Mon, 13 Aug 2012 19:03:27 -0600 (MDT) Subject: [torquedev] [Bug 214] qsub weird behavior when using filter script In-Reply-To: References: Message-ID: <20120814010327.753A54122AFF@http.supercluster.org> http://www.clusterresources.com/bugzilla/show_bug.cgi?id=214 Chris Samuel changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |chris at csamuel.org --- Comment #1 from Chris Samuel 2012-08-13 19:03:27 MDT --- That appears to be a regression in 2.5 from 2.4 then as it works OK here. -- Configure bugmail: http://www.clusterresources.com/bugzilla/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug. From bugzilla-daemon at supercluster.org Tue Aug 14 13:01:40 2012 From: bugzilla-daemon at supercluster.org (bugzilla-daemon at supercluster.org) Date: Tue, 14 Aug 2012 13:01:40 -0600 (MDT) Subject: [torquedev] [Bug 214] qsub weird behavior when using filter script In-Reply-To: References: Message-ID: <20120814190140.102CC4121944@http.supercluster.org> http://www.clusterresources.com/bugzilla/show_bug.cgi?id=214 tgray26 changed: What |Removed |Added ---------------------------------------------------------------------------- Severity|enhancement |major -- Configure bugmail: http://www.clusterresources.com/bugzilla/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug. From bugzilla-daemon at supercluster.org Wed Aug 15 07:56:18 2012 From: bugzilla-daemon at supercluster.org (bugzilla-daemon at supercluster.org) Date: Wed, 15 Aug 2012 07:56:18 -0600 (MDT) Subject: [torquedev] [Bug 202] trqauthd patches for daemon mode In-Reply-To: References: Message-ID: <20120815135618.61A8E41210A6@http.supercluster.org> http://www.clusterresources.com/bugzilla/show_bug.cgi?id=202 --- Comment #2 from Bas van der Vlies 2012-08-15 07:56:18 MDT --- Any comment on this 'simple' patch -- Configure bugmail: http://www.clusterresources.com/bugzilla/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug. From bugzilla-daemon at supercluster.org Wed Aug 15 07:59:02 2012 From: bugzilla-daemon at supercluster.org (bugzilla-daemon at supercluster.org) Date: Wed, 15 Aug 2012 07:59:02 -0600 (MDT) Subject: [torquedev] [Bug 200] pbs_mkdirs chk_tree for checkpoint dir is wrong In-Reply-To: References: Message-ID: <20120815135902.4EE6541212F0@http.supercluster.org> http://www.clusterresources.com/bugzilla/show_bug.cgi?id=200 --- Comment #1 from Bas van der Vlies 2012-08-15 07:59:02 MDT --- One part of this patch is committed. So any comment on ths 'simple' patch? -- Configure bugmail: http://www.clusterresources.com/bugzilla/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug. From bugzilla-daemon at supercluster.org Wed Aug 15 11:29:55 2012 From: bugzilla-daemon at supercluster.org (bugzilla-daemon at supercluster.org) Date: Wed, 15 Aug 2012 11:29:55 -0600 (MDT) Subject: [torquedev] [Bug 212] server spins on select() with expired sockets In-Reply-To: References: Message-ID: <20120815172955.15D3441211D8@http.supercluster.org> http://www.clusterresources.com/bugzilla/show_bug.cgi?id=212 David Beer changed: What |Removed |Added ---------------------------------------------------------------------------- Status|NEW |RESOLVED CC| |dbeer at adaptivecomputing.com Resolution| |FIXED --- Comment #1 from David Beer 2012-08-15 11:29:54 MDT --- I checked a fix into 4.1-fixes. -- Configure bugmail: http://www.clusterresources.com/bugzilla/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug. From s.prabhakaran at grs-sim.de Thu Aug 16 07:55:18 2012 From: s.prabhakaran at grs-sim.de (Suraj Prabhakaran) Date: Thu, 16 Aug 2012 15:55:18 +0200 Subject: [torquedev] Torque Maui Communication during job submission Message-ID: <076A035A-3C13-4021-94A3-DD263A1CB0E0@grs-sim.de> Hello, I have been looking into torque and maui communication for some days. I have a question regarding job submission. During a qsub command, does Maui get the information about the qsub only from torque or does it also get directly from the client? Again, any pointers to torque-maui documentation with more descriptions could be very helpful! Best regards, Suraj -------------------------- Suraj Prabhakaran German Research School for Simulation Sciences GmbH Laboratory for Parallel Progreamming 52062 Aachen | Germany Tel +49 241 80 99743 Fax +49 241 80 92742 EMail s.prabhakaran at grs-sim.de Web www.grs-sim.de Members: Forschungszentrum J?lich GmbH | RWTH Aachen University Registered in the commercial register of the local court of D?ren (Amtsgericht D?ren) under registration number HRB 5268 Registered office: J?lich Executive board: Prof. Marek Behr Ph.D. | Dr. Norbert Drewes -------------- next part -------------- An HTML attachment was scrubbed... URL: http://www.supercluster.org/pipermail/torquedev/attachments/20120816/d76aaec0/attachment.html From bugzilla-daemon at supercluster.org Thu Aug 16 16:06:01 2012 From: bugzilla-daemon at supercluster.org (bugzilla-daemon at supercluster.org) Date: Thu, 16 Aug 2012 16:06:01 -0600 (MDT) Subject: [torquedev] [Bug 214] qsub weird behavior when using filter script In-Reply-To: References: Message-ID: <20120816220601.DC1CB2578003@http.supercluster.org> http://www.clusterresources.com/bugzilla/show_bug.cgi?id=214 tgray26 changed: What |Removed |Added ---------------------------------------------------------------------------- Status|NEW |RESOLVED Resolution| |FIXED --- Comment #2 from tgray26 2012-08-16 16:06:01 MDT --- This behavior was being exhibited in 2.5.8, but I just upgraded one of our clusters to 2.5.12 and the problem was gone. Looking in the change logs I believe this was fixed in 2.5.10 by this change: b - Removed a check for Interactive jobs in qsub and the -l flag. This check appeared to be code that was never completed and it prevented the passing of resource arguments. -- Configure bugmail: http://www.clusterresources.com/bugzilla/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug. From bugzilla-daemon at supercluster.org Thu Aug 16 19:59:22 2012 From: bugzilla-daemon at supercluster.org (bugzilla-daemon at supercluster.org) Date: Thu, 16 Aug 2012 19:59:22 -0600 (MDT) Subject: [torquedev] [Bug 209] pbs_server rejects Obits with cray_enabled In-Reply-To: References: Message-ID: <20120817015922.21F676780C1@http.supercluster.org> http://www.clusterresources.com/bugzilla/show_bug.cgi?id=209 --- Comment #1 from Matt Ezell 2012-08-16 19:59:21 MDT --- It looks like this might be fixed by revision 6675. Next time I get a chance, I'll upgrade and report back. -- Configure bugmail: http://www.clusterresources.com/bugzilla/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug. From bugzilla-daemon at supercluster.org Fri Aug 17 09:05:09 2012 From: bugzilla-daemon at supercluster.org (bugzilla-daemon at supercluster.org) Date: Fri, 17 Aug 2012 09:05:09 -0600 (MDT) Subject: [torquedev] [Bug 209] pbs_server rejects Obits with cray_enabled In-Reply-To: References: Message-ID: <20120817150509.311BD4122011@http.supercluster.org> http://www.clusterresources.com/bugzilla/show_bug.cgi?id=209 David Beer changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |dbeer at adaptivecomputing.com --- Comment #2 from David Beer 2012-08-17 09:05:09 MDT --- Matt, Sorry I neglected to update this bug. Yes, this should be fixed as of revision 6675. I'll wait for you to verify before I close the bug. David (In reply to comment #1) > It looks like this might be fixed by revision 6675. Next time I get a chance, > I'll upgrade and report back. -- Configure bugmail: http://www.clusterresources.com/bugzilla/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug. From bugzilla-daemon at supercluster.org Tue Aug 21 09:13:33 2012 From: bugzilla-daemon at supercluster.org (bugzilla-daemon at supercluster.org) Date: Tue, 21 Aug 2012 09:13:33 -0600 (MDT) Subject: [torquedev] [Bug 211] pbs_sched does not read TRQ_IFNAME In-Reply-To: References: Message-ID: <20120821151333.EEE6241213C8@http.supercluster.org> http://www.clusterresources.com/bugzilla/show_bug.cgi?id=211 --- Comment #8 from Taras 2012-08-21 09:13:33 MDT --- Any comments about listening to any address? -- Configure bugmail: http://www.clusterresources.com/bugzilla/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug. From bugzilla-daemon at supercluster.org Tue Aug 21 20:17:06 2012 From: bugzilla-daemon at supercluster.org (bugzilla-daemon at supercluster.org) Date: Tue, 21 Aug 2012 20:17:06 -0600 (MDT) Subject: [torquedev] [Bug 211] pbs_sched does not read TRQ_IFNAME In-Reply-To: References: Message-ID: <20120822021706.BBD9941218BD@http.supercluster.org> http://www.clusterresources.com/bugzilla/show_bug.cgi?id=211 --- Comment #9 from Chris Samuel 2012-08-21 20:17:06 MDT --- (In reply to comment #7) > I wonder is there any reason why pbs_sched does not listening to any address > (as PBS Pro pbs_sched does)? I suspect that's just something that the PBSPro people added after they took it closed source. I don't see any reason why pbs_sched shouldn't be able to do this but as Ken said the question is whether torque.cfg is the right place. I'd have thought so if Ken hadn't pointed out it's only for qsub (but then I'd have expected it to be called qsub.cfg). Reading the manual page for pbs_sched_cc (we didn't have it installed as we don't use it here) it does say it takes a -c option for a config file, but it won't read one at all if it's not passed to it at startup. I'd say that'd be the place to define it.. -- Configure bugmail: http://www.clusterresources.com/bugzilla/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug. From bugzilla-daemon at supercluster.org Wed Aug 22 03:23:55 2012 From: bugzilla-daemon at supercluster.org (bugzilla-daemon at supercluster.org) Date: Wed, 22 Aug 2012 03:23:55 -0600 (MDT) Subject: [torquedev] [Bug 211] pbs_sched does not read TRQ_IFNAME In-Reply-To: References: Message-ID: <20120822092355.170A94122D3C@http.supercluster.org> http://www.clusterresources.com/bugzilla/show_bug.cgi?id=211 --- Comment #10 from Taras 2012-08-22 03:23:54 MDT --- Ok, Chris, I got it about torque.cfg. We also don't use pbs_sched, but some our customers are using it on complex network configurations. So, my second question was why pbs_sched can not listen to ANY address ALL the time. In this case no config parameter is needed at all. Maybe there are any security reasons to not listen to any address? -- Configure bugmail: http://www.clusterresources.com/bugzilla/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug. From bugzilla-daemon at supercluster.org Wed Aug 22 09:41:48 2012 From: bugzilla-daemon at supercluster.org (bugzilla-daemon at supercluster.org) Date: Wed, 22 Aug 2012 09:41:48 -0600 (MDT) Subject: [torquedev] [Bug 211] pbs_sched does not read TRQ_IFNAME In-Reply-To: References: Message-ID: <20120822154148.DF7334121FEB@http.supercluster.org> http://www.clusterresources.com/bugzilla/show_bug.cgi?id=211 Michael Jennings changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |mej at lbl.gov --- Comment #11 from Michael Jennings 2012-08-22 09:41:48 MDT --- (In reply to comment #10) > So, my second question was why pbs_sched can not listen to ANY address ALL the > time. In this case no config parameter is needed at all. Maybe there are any > security reasons to not listen to any address? That would be an understatement. :-) I'm sure Bright Computing has security people on staff. I bet one of them could go into more detail for you, but the short answer is that you never want private services listening publicly. It's an unnecessary risk. -- Configure bugmail: http://www.clusterresources.com/bugzilla/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug. From bugzilla-daemon at supercluster.org Wed Aug 22 10:28:02 2012 From: bugzilla-daemon at supercluster.org (bugzilla-daemon at supercluster.org) Date: Wed, 22 Aug 2012 10:28:02 -0600 (MDT) Subject: [torquedev] [Bug 211] pbs_sched does not read TRQ_IFNAME In-Reply-To: References: Message-ID: <20120822162802.1E8AA4121FFA@http.supercluster.org> http://www.clusterresources.com/bugzilla/show_bug.cgi?id=211 --- Comment #12 from Taras 2012-08-22 10:28:01 MDT --- Hi Michael, > I'm sure Bright Computing has security people on staff. I bet one of them > could go into more detail for you, but the short answer is that you never want > private services listening publicly. It's an unnecessary risk. Sure, of cause I agree (and secure people will agree as well) that this is a bad practice in general. But: 1. On computing clusters usually all ports (except several) are closed for external interfaces. 2. PBS Pro pbs_sched and pbs_mom listen to any (ok, lets suppose for now there are no security people in PBS Pro team). 3. TORQUE pbs_server listens to any: tcp 0 0 0.0.0.0:15001 0.0.0.0:* LISTEN 3682/pbs_server Could you, please, explain if you follow the rule "do not listen to any" then why TORQUE pbs_server does not follow this rule as well? -- Configure bugmail: http://www.clusterresources.com/bugzilla/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug. From bugzilla-daemon at supercluster.org Wed Aug 22 13:18:21 2012 From: bugzilla-daemon at supercluster.org (bugzilla-daemon at supercluster.org) Date: Wed, 22 Aug 2012 13:18:21 -0600 (MDT) Subject: [torquedev] [Bug 211] pbs_sched does not read TRQ_IFNAME In-Reply-To: References: Message-ID: <20120822191821.3586941221F9@http.supercluster.org> http://www.clusterresources.com/bugzilla/show_bug.cgi?id=211 --- Comment #13 from Michael Jennings 2012-08-22 13:18:21 MDT --- (In reply to comment #12) > > the short answer is that you never want > > private services listening publicly. It's an unnecessary risk. > > Sure, of cause I agree (and secure people will agree as well) that this is a > bad practice in general. > > But: > > 1. On computing clusters usually all ports (except several) are closed for > external interfaces. Closed by what? By definition, the ports aren't closed if there's something listening on them. And if you're referring to a firewall...well, they fail. :-) > 2. PBS Pro pbs_sched and pbs_mom listen to any (ok, lets suppose for now there > are no security people in PBS Pro team). Note the word "private" in my previous comment. pbs_mom is not a private service. In TORQUE, pbs_sched is. Maybe it's not in PBSPro; I have no idea. The two diverged a long time ago, and just because they share an ancestry doesn't mean one can make assumptions about commonalities of current behavior. If one could, we'd all be climbing trees and slinging poo like the other primates. ;-) > 3. TORQUE pbs_server listens to any: > tcp 0 0 0.0.0.0:15001 0.0.0.0:* LISTEN 3682/pbs_server See above. pbs_server needs to listen to other hosts. pbs_sched doesn't AFAIK. > Could you, please, explain if you follow the rule "do not listen to any" then > why TORQUE pbs_server does not follow this rule as well? See above. :-) As with most products, the defaults are configured for the general case. They won't cover every possible use case for every possible user. For the majority of users, pbs_sched listening on localhost only is the correct choice. Same for trqauthd. Sure, they could default to listening on 0.0.0.0, but that would violate the Principle of Least Privilege (see https://developer.apple.com/library/mac/#documentation/Security/Conceptual/Security_Overview/SecuritySvcs/SecuritySvcs.html#//apple_ref/doc/uid/TP40002650-SW4 for more). -- Configure bugmail: http://www.clusterresources.com/bugzilla/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug. From knielson at adaptivecomputing.com Thu Aug 30 14:34:11 2012 From: knielson at adaptivecomputing.com (Ken Nielson) Date: Thu, 30 Aug 2012 14:34:11 -0600 Subject: [torquedev] TORQUE 4.1.1 available Message-ID: TORQUE version 4.1.1 is now available for general download. There were several bugs fixed in this version of TORQUE. Several deadlock issues were fixed around the combination of job arrays and routing queues. x11-forwarding was fixed for interactive jobs. There were fixes for memory corruption and double free. There were 5 memory leaks that were fixed. The mail feature we re-enabled. It had been removed in earlier versions of TORQUE 4.x For a complete list of fixes see the CHANGELOG. We want to thank The University of Michigan, NOAA, University of Florida, LBNL and Cray for their help in finding and fixing many of the bugs for this release. We also appreciate the contributions made by others to the code base. The tar ball for this release can be downloaded at the following URL. http://www.adaptivecomputing.com/support/download-center/torque-download/torque-4.1.1.tar.gz Thanks again for all of the help. The feedback from the community is what makes TORQUE the best it can be. Regards Ken Nielson Adaptive Computing -------------- next part -------------- An HTML attachment was scrubbed... URL: http://www.supercluster.org/pipermail/torquedev/attachments/20120830/eb3da18a/attachment.html From glen.beane at gmail.com Thu Aug 30 15:03:26 2012 From: glen.beane at gmail.com (Glen Beane) Date: Thu, 30 Aug 2012 17:03:26 -0400 Subject: [torquedev] TORQUE 4.1.1 available In-Reply-To: References: Message-ID: On Thu, Aug 30, 2012 at 4:34 PM, Ken Nielson wrote: > TORQUE version 4.1.1 is now available for general download. > > There were several bugs fixed in this version of TORQUE. Several deadlock > issues were fixed around the combination of job arrays and routing queues. > x11-forwarding was fixed for interactive jobs. > There were fixes for memory corruption and double free. > There were 5 memory leaks that were fixed. > The mail feature we re-enabled. It had been removed in earlier versions of > TORQUE 4.x > > For a complete list of fixes see the CHANGELOG. Hi Ken, is there a direct URL for the CHANGELOG or do I need to download the whole .tar.gz? From go-yoshimura at sstc.co.jp Fri Aug 31 06:03:29 2012 From: go-yoshimura at sstc.co.jp (Go Yoshimura) Date: Fri, 31 Aug 2012 12:03:29 -0000 Subject: [torquedev] mistake in torque-4.1.1/src/server/node_manager.c Message-ID: <201208311204.AA14038@winxp-pc.sstc.co.jp> Hello, I found mistake in torque-4.1.1/src/server/node_manager.c. [test01 at torque03 server]$ gcc -DHAVE_CONFIG_H -I. -I../../src/include -I../../src/include -DPBS_SERVER_HOME=\"/var/torque\" -DPBS_ENVIRON=\"/var/torque/pbs_environment\" `xml2-config --cflags` -g - O2 -W -Wall -Wno-unused-parameter -Wno-long-long -pedantic -Werror -MT node_manager.o -MD -MP -MF .deps/node_manager.Tpo -c -o node_manager.o node_manager.c cc1: warnings being treated as errors node_manager.c: In function ?is_reserved_property?: node_manager.c:3814: warning: comparison of unsigned expression < 0 is always false [test01 at torque03 server]$ cp -p node_manager.c node_manager.c.orig [test01 at torque03 server]$ vi node_manager.c [test01 at torque03 server]$ diff -u node_manager.c node_manager.c.orig --- node_manager.c 2012-08-31 20:42:57.000000000 +0900 +++ node_manager.c.orig 2012-08-25 05:42:59.000000000 +0900 @@ -3811,7 +3811,7 @@ { if ((strncmp(prop, "ppn", strlen("ppn")) == 0) || - (strncmp(prop, "gpus", strlen("gpus")) == 0) || + (strncmp(prop, "gpus", strlen("gpus") == 0)) || (strncasecmp(prop, "exclusive_thread", strlen("exclusive_thread")) == 0) || (strncasecmp(prop, "exclusive", strlen("exclusive")) == 0) || (strncasecmp(prop, "exclusive_process", strlen("exclusive_process")) == 0) || [test01 at torque03 server]$ gcc -DHAVE_CONFIG_H -I. -I../../src/include -I../../src/include -DPBS_SERVER_HOME=\"/var/torque\" -DPBS_ENVIRON=\"/var/torque/pbs_environment\" `xml2-config --cflags` -g - O2 -W -Wall -Wno-unused-parameter -Wno-long-long -pedantic -Werror -MT node_manager.o -MD -MP -MF .deps/node_manager.Tpo -c -o node_manager.o node_manager.c [test01 at torque03 server]$ pwd /usr/local/src/torque-4.1.1/src/server ((reproduce)) (1) with "--enable-gcc-warnings" ./configure --prefix=/usr/local/torque --with-server-home=/var/torque --enable-docs --with-rcp=rcp --enable-gcc-warnings make --jobs=4 gcc -DHAVE_CONFIG_H -I. -I../../src/include -I../../src/include -DPBS_SERVER_HOME=\"/var/torque\" -DPBS_ENVIRON=\"/var/torque/pbs_environment\" `xml2-config --cflags` -g -O2 -W -Wall -Wno-unused- parameter -Wno-long-long -pedantic -Werror -MT dis_read.o -MD -MP -MF .deps/dis_read.Tpo -c -o dis_read.o dis_read.c cc1: warnings being treated as errors array_func.c: In function ?modify_array_range?: array_func.c:1545: warning: dereferencing type-punned pointer will break strict-aliasing rules make[3]: *** [array_func.o] Error 1 make[3]: *** Waiting for unfinished jobs.... mv -f .deps/accounting.Tpo .deps/accounting.Po mv -f .deps/attr_recov.Tpo .deps/attr_recov.Po mv -f .deps/dis_read.Tpo .deps/dis_read.Po make[3]: Leaving directory `/usr/local/src/torque-4.1.1/src/server' make[2]: *** [all-recursive] Error 1 make[2]: Leaving directory `/usr/local/src/torque-4.1.1/src/server' make[1]: *** [all-recursive] Error 1 make[1]: Leaving directory `/usr/local/src/torque-4.1.1/src' make: *** [all-recursive] Error 1 (2)skip above error with -fno-strict-aliasing [test01 at torque03 torque-4.1.1]$ cd src/server/ [test01 at torque03 server]$ gcc -DHAVE_CONFIG_H -I. -I../../src/include -I../../src/include -DPBS_SERVER_HOME=\"/var/torque\" -DPBS_ENVIRON=\"/var/torque/pbs_environment\" `xml2-config --cflags` -g - O2 -W -Wall -Wno-unused-parameter -Wno-long-long -pedantic -Werror -MT array_func.o -MD -MP -MF .deps/array_func.Tpo -c -o array_func.o array_func.c cc1: warnings being treated as errors array_func.c: In function ?modify_array_range?: array_func.c:1545: warning: dereferencing type-punned pointer will break strict-aliasing rules [test01 at torque03 server]$ gcc -DHAVE_CONFIG_H -I. -I../../src/include -I../../src/include -DPBS_SERVER_HOME=\"/var/torque\" -DPBS_ENVIRON=\"/var/torque/pbs_environment\" `xml2-config --cflags` -g - O2 -W -Wall -Wno-unused-parameter -Wno-long-long -pedantic -Werror -MT array_func.o -MD -MP -MF .deps/array_func.Tpo -c -o array_func.o array_func.c -fno-strict-aliasing (3)back to the top of src and re-run make [test01 at torque03 server]$ cd .. [test01 at torque03 src]$ cd .. [test01 at torque03 torque-4.1.1]$ make gcc -DHAVE_CONFIG_H -I. -I../../src/include -I../../src/include -DPBS_SERVER_HOME=\"/var/torque\" -DPBS_ENVIRON=\"/var/torque/pbs_environment\" `xml2-config --cflags` -g -O2 -W -Wall -Wno-unused- parameter -Wno-long-long -pedantic -Werror -MT node_manager.o -MD -MP -MF .deps/node_manager.Tpo -c -o node_manager.o node_manager.c cc1: warnings being treated as errors node_manager.c: In function ?is_reserved_property?: node_manager.c:3814: warning: comparison of unsigned expression < 0 is always false make[3]: *** [node_manager.o] Error 1 make[3]: Leaving directory `/usr/local/src/torque-4.1.1/src/server' make[2]: *** [all-recursive] Error 1 make[2]: Leaving directory `/usr/local/src/torque-4.1.1/src/server' make[1]: *** [all-recursive] Error 1 make[1]: Leaving directory `/usr/local/src/torque-4.1.1/src' make: *** [all-recursive] Error 1 ---- Go Yoshimura Scalable Systems Co., Ltd. Osaka Office HONMACHI-COLLABO Bldg. 4F, 4-4-2 Kita-kyuhoji-machi, Chuo-ku, Osaka 541-0057 Japan Tel: 81-6-6224-4115 Tokyo Kojimachi Office BUREX Kojimachi 11F, 3-5-2 Kojimachi, Chiyoda-ku, Tokyo 102-0083 Japan Tel: 81-3-5875-4718 Fax: 81-3-3237-7612