[torquedev] pbs_mom changing UID and GID

Glen Beane glen.beane at gmail.com
Mon Oct 17 08:33:32 MDT 2011


has anyone else seen this with 2.5.6?   has upgrading helped?



On Thu, Oct 13, 2011 at 2:06 PM, Glen Beane <glen.beane at gmail.com> wrote:
> In July Chris Samuel reported a bug on this list regarding torque
> 2.4.13 - 2.4.15 where the UID and GID of pbs_mom could get set to a
> user if the $tmpdir option were being used.  In this email thread Ken
> Nielson originally said this bug was present in 2.5 (and 3.0?), and
> then later stated it affected 2.4 only.
>
>
> I am running TORQUE 2.5.6 on my cluster, and we are seeing certain
> cases where the UID of pbs_mom changes after cleaning up a temporary
> directory. It seems this may be the same problem.  Was this fixed in a
> later 2.5 release?
>
>
> here is a snippet of the pbs_mom logs:
>
> 10/12/2011 08:41:14;0080;
> pbs_mom;Job;32765.scyld.localdomain;removing transient job directory
> /scratch/32765.s
> cyld.localdomain
> 10/12/2011 08:43:15;0002;   pbs_mom;Svr;pbs_mom;Torque Mom Version =
> 2.5.6, loglevel = 010/12/2011 08:48:01;0001;
> pbs_mom;Svr;pbs_mom;LOG_ERROR::Permission denied (13) in task_save,
> error on open
> 10/12/2011 08:48:01;0001;   pbs_mom;Svr;pbs_mom;LOG_ERROR::Permission
> denied (13) in task_save, error on open
> 10/12/2011 08:48:01;0080;
> pbs_mom;Job;32764.scyld.localdomain;scan_for_terminated: job
> 32764.scyld.localdomain task 1 terminated, sid=9489210/12/2011
> 08:48:01;0008;   pbs_mom;Job;32764.scyld.localdomain;job was
> terminated
> 10/12/2011 08:48:01;0001;   pbs_mom;Svr;pbs_mom;LOG_ERROR::Permission
> denied (13) in job_save, cannot open file '
> /var/spool/torque/mom_priv/jobs/32764.scyld.localdomain.JB' for job
> 32764.scyld.localdomain in state EXITING (quick)
> 10/12/2011 08:48:01;0001;   pbs_mom;Svr;pbs_mom;LOG_ERROR::Permission
> denied (13) in task_save, error on open10/12/2011 08:48:01;0001;
> pbs_mom;Svr;pbs_mom;LOG_ERROR::Permission denied (13) in
> scan_for_exiting, cannot bin
> d to reserved port in client_to_svr - errno: 13 Permission
> denied10/12/2011 08:48:01;0001;
> pbs_mom;Svr;pbs_mom;LOG_ERROR::Permission denied (13) in
> scan_for_exiting, cannot bin
> d to reserved port in client_to_svr - errno: 13 Permission
> denied10/12/2011 08:48:01;0001;
> pbs_mom;Svr;pbs_mom;LOG_ERROR::Permission denied (13) in
> scan_for_exiting, cannot bin
> d to reserved port in client_to_svr - errno: 13 Permission
> denied10/12/2011 08:48:01;0001;
> pbs_mom;Svr;pbs_mom;LOG_ERROR::Permission denied (13) in
> scan_for_exiting, cannot bind to reserved port in client_to_svr -
> errno: 13 Permission denied
> 10/12/2011 08:48:02;0001;   pbs_mom;Svr;pbs_mom;LOG_ERROR::Permission
> denied (13) in scan_for_exiting, cannot bind to reserved port in
> client_to_svr - errno: 13 Permission denied
> 10/12/2011 08:48:03;0001;   pbs_mom;Svr;pbs_mom;LOG_ERROR::Permission
> denied (13) in scan_for_exiting, cannot bind to reserved port in
> client_to_svr - errno: 13 Permission denied
> 10/12/2011 08:48:03;0001;   pbs_mom;Svr;pbs_mom;LOG_ERROR::Permission
> denied (13) in scan_for_exiting, cannot bind to reserved port in
> client_to_svr - errno: 13 Permission denied
> 10/12/2011 08:48:03;0001;   pbs_mom;Svr;pbs_mom;LOG_ERROR::Permission
> denied (13) in scan_for_exiting, cannot bind to reserved port in
> client_to_svr - errno: 13 Permission denied
> 10/12/2011 08:48:03;0001;   pbs_mom;Svr;pbs_mom;LOG_ERROR::Permission
> denied (13) in scan_for_exiting, cannot bind to reserved port in
> client_to_svr - errno: 13 Permission denied
> 10/12/2011 08:48:03;0001;   pbs_mom;Svr;pbs_mom;LOG_ERROR::Permission
> denied (13) in scan_for_exiting, cannot bind to reserved port in
> client_to_svr - errno: 13 Permission denied
> 10/12/2011 08:48:03;0001;   pbs_mom;Svr;pbs_mom;LOG_ERROR::Permission
> denied (13) in scan_for_exiting, cannot bind to reserved port in
> client_to_svr - errno: 13 Permission denied
> 10/12/2011 08:48:04;0001;   pbs_mom;Svr;pbs_mom;LOG_ERROR::Permission
> denied (13) in scan_for_exiting, cannot bind to reserved port in
> client_to_svr - errno: 13 Permission denied
> 10/12/2011 08:48:05;0001;   pbs_mom;Svr;pbs_mom;LOG_ERROR::Permission
> denied (13) in scan_for_exiting, cannot bind to reserved port in
> client_to_svr - errno: 13 Permission denied
>


More information about the torquedev mailing list