[torquedev] pbs_mom changing UID and GID

Glen Beane glen.beane at gmail.com
Thu Oct 13 12:06:37 MDT 2011


In July Chris Samuel reported a bug on this list regarding torque
2.4.13 - 2.4.15 where the UID and GID of pbs_mom could get set to a
user if the $tmpdir option were being used.  In this email thread Ken
Nielson originally said this bug was present in 2.5 (and 3.0?), and
then later stated it affected 2.4 only.


I am running TORQUE 2.5.6 on my cluster, and we are seeing certain
cases where the UID of pbs_mom changes after cleaning up a temporary
directory. It seems this may be the same problem.  Was this fixed in a
later 2.5 release?


here is a snippet of the pbs_mom logs:

10/12/2011 08:41:14;0080;
pbs_mom;Job;32765.scyld.localdomain;removing transient job directory
/scratch/32765.s
cyld.localdomain
10/12/2011 08:43:15;0002;   pbs_mom;Svr;pbs_mom;Torque Mom Version =
2.5.6, loglevel = 010/12/2011 08:48:01;0001;
pbs_mom;Svr;pbs_mom;LOG_ERROR::Permission denied (13) in task_save,
error on open
10/12/2011 08:48:01;0001;   pbs_mom;Svr;pbs_mom;LOG_ERROR::Permission
denied (13) in task_save, error on open
10/12/2011 08:48:01;0080;
pbs_mom;Job;32764.scyld.localdomain;scan_for_terminated: job
32764.scyld.localdomain task 1 terminated, sid=9489210/12/2011
08:48:01;0008;   pbs_mom;Job;32764.scyld.localdomain;job was
terminated
10/12/2011 08:48:01;0001;   pbs_mom;Svr;pbs_mom;LOG_ERROR::Permission
denied (13) in job_save, cannot open file '
/var/spool/torque/mom_priv/jobs/32764.scyld.localdomain.JB' for job
32764.scyld.localdomain in state EXITING (quick)
10/12/2011 08:48:01;0001;   pbs_mom;Svr;pbs_mom;LOG_ERROR::Permission
denied (13) in task_save, error on open10/12/2011 08:48:01;0001;
pbs_mom;Svr;pbs_mom;LOG_ERROR::Permission denied (13) in
scan_for_exiting, cannot bin
d to reserved port in client_to_svr - errno: 13 Permission
denied10/12/2011 08:48:01;0001;
pbs_mom;Svr;pbs_mom;LOG_ERROR::Permission denied (13) in
scan_for_exiting, cannot bin
d to reserved port in client_to_svr - errno: 13 Permission
denied10/12/2011 08:48:01;0001;
pbs_mom;Svr;pbs_mom;LOG_ERROR::Permission denied (13) in
scan_for_exiting, cannot bin
d to reserved port in client_to_svr - errno: 13 Permission
denied10/12/2011 08:48:01;0001;
pbs_mom;Svr;pbs_mom;LOG_ERROR::Permission denied (13) in
scan_for_exiting, cannot bind to reserved port in client_to_svr -
errno: 13 Permission denied
10/12/2011 08:48:02;0001;   pbs_mom;Svr;pbs_mom;LOG_ERROR::Permission
denied (13) in scan_for_exiting, cannot bind to reserved port in
client_to_svr - errno: 13 Permission denied
10/12/2011 08:48:03;0001;   pbs_mom;Svr;pbs_mom;LOG_ERROR::Permission
denied (13) in scan_for_exiting, cannot bind to reserved port in
client_to_svr - errno: 13 Permission denied
10/12/2011 08:48:03;0001;   pbs_mom;Svr;pbs_mom;LOG_ERROR::Permission
denied (13) in scan_for_exiting, cannot bind to reserved port in
client_to_svr - errno: 13 Permission denied
10/12/2011 08:48:03;0001;   pbs_mom;Svr;pbs_mom;LOG_ERROR::Permission
denied (13) in scan_for_exiting, cannot bind to reserved port in
client_to_svr - errno: 13 Permission denied
10/12/2011 08:48:03;0001;   pbs_mom;Svr;pbs_mom;LOG_ERROR::Permission
denied (13) in scan_for_exiting, cannot bind to reserved port in
client_to_svr - errno: 13 Permission denied
10/12/2011 08:48:03;0001;   pbs_mom;Svr;pbs_mom;LOG_ERROR::Permission
denied (13) in scan_for_exiting, cannot bind to reserved port in
client_to_svr - errno: 13 Permission denied
10/12/2011 08:48:03;0001;   pbs_mom;Svr;pbs_mom;LOG_ERROR::Permission
denied (13) in scan_for_exiting, cannot bind to reserved port in
client_to_svr - errno: 13 Permission denied
10/12/2011 08:48:04;0001;   pbs_mom;Svr;pbs_mom;LOG_ERROR::Permission
denied (13) in scan_for_exiting, cannot bind to reserved port in
client_to_svr - errno: 13 Permission denied
10/12/2011 08:48:05;0001;   pbs_mom;Svr;pbs_mom;LOG_ERROR::Permission
denied (13) in scan_for_exiting, cannot bind to reserved port in
client_to_svr - errno: 13 Permission denied


More information about the torquedev mailing list