[torquedev] [Bug 113] New: munge support (AlternateUserAuthentication) has still some issues on torque-2.5.4
bugzilla-daemon at supercluster.org
bugzilla-daemon at supercluster.org
Wed Feb 16 08:38:23 MST 2011
http://www.clusterresources.com/bugzilla/show_bug.cgi?id=113
Summary: munge support (AlternateUserAuthentication) has still
some issues on torque-2.5.4
Product: TORQUE
Version: 2.5.x
Platform: All
OS/Version: All
Status: NEW
Severity: normal
Priority: P5
Component: pbs_server
AssignedTo: dbeer at adaptivecomputing.com
ReportedBy: juergen.hennerich at gmail.com
CC: torquedev at supercluster.org
Estimated Hours: 0.0
When no munged is running on a host or a node, torque creates an epmty file in
the credentials directory and doesn't close its open fd, which may lead to a
point where the torque process is running out of file descriptors.
The error seems in req_getcred.c line 257+ : I guess there is a
"close(fd);unlink(mungeFileName);" missing.
The bigger Problem ist, that even when munge is disabled on a node (with node
submit allowed) a user can still submit or query:
[zrshj01 at n010102 ~]$ qsub -l nodes=1:ppn=24 test.sh
munge: Error: Unable to access "/var/run/munge/munge.socket.2": No such file or
directory
2484465.icmu03
02/16/2011 16:32:09;0100;PBS_Server;Req;;Type AlternateUserAuthentication
request received from zrshj01 at n010102, sock=12
02/16/2011 16:32:09;0080;PBS_Server;Req;req_reject;Reject reply
code=15021(Invalid credential MSG=cannot authenticate), aux=0,
type=AlternateUserAuthenticat
ion, from zrshj01 at n010102
02/16/2011 16:32:09;0100;PBS_Server;Req;;Type QueueJob request received from
zrshj01 at n010102, sock=12
02/16/2011 16:32:09;0100;PBS_Server;Req;;Type JobScript request received from
zrshj01 at n010102, sock=12
02/16/2011 16:32:09;0100;PBS_Server;Req;;Type Commit request received from
zrshj01 at n010102, sock=12
02/16/2011 16:32:09;0100;PBS_Server;Job;2484465.icmu03;enqueuing into user,
state 1 hop 1
02/16/2011 16:32:09;0100;PBS_Server;Job;2484465.icmu03;dequeuing from user,
state QUEUED
02/16/2011 16:32:09;0100;PBS_Server;Job;2484465.icmu03;enqueuing into
tue-short, state 1 hop 1
02/16/2011 16:32:09;0008;PBS_Server;Job;2484465.icmu03;Job Queued at request of
zrshj01 at n010102, owner = zrshj01 at n010102, job name = test.sh, queue = tue-sh
ort
Juergen Hennerich
--
Configure bugmail: http://www.clusterresources.com/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are on the CC list for the bug.
More information about the torquedev
mailing list