[torqueusers] Priv Port errors after upgrade to 4.1.5

David Beer dbeer at adaptivecomputing.com
Mon Mar 4 10:15:16 MST 2013


On Mon, Mar 4, 2013 at 9:24 AM, Joerg Blank <j.blank at fz-juelich.de> wrote:

> Hello everyone,
>
> I just tried to to upgrade from 4.1.4 to 4.1.5 and got a lot of priv
> port errors after startup:
>
> 03/04/2013
> 15:39:35;0001;PBS_Server.29263;Svr;PBS_Server;LOG_ERROR::Error getting
> connection to socket (15096) in tcp_connect_sockaddr, Failed when trying
> to get privileged port - socket_get_tcp_priv() failed
>
> I never saw this in 4.1.4 or earlier. Any idea?
>
> I also get a crash shortly after startup finished (qstat works for like
> 5-10 secs)
>
> #1  0x00007fd4bf4c3c56 in *__GI___strdup (s=0x0) at strdup.c:42
> #2  0x0000000000441876 in stat_to_mom (job_id=0x6656110
> "31699[].glorim-1.cluster", cntl=0x7fd4b00811c0) at req_stat.c:901
> #3  0x0000000000442aad in stat_mom_job (job_id=0x6656110
> "31699[].glorim-1.cluster") at req_stat.c:1120
> #4  0x0000000000442c1d in poll_job_task (ptask=0x66622f0) at
> req_stat.c:1170
> #5  0x000000000045b1a2 in work_thread (a=0x7fd4be39b920) at
> u_threadpool.c:307
> #6  0x00007fd4bf9b88ca in start_thread (arg=<value optimized out>) at
> pthread_create.c:300
> #7  0x00007fd4bf517b6d in clone () at
> ../sysdeps/unix/sysv/linux/x86_64/clone.S:112
>
>
This doesn't appear to be related to any new code, but for some reason you
have a poll job task for a job that has no exec_host list. The routine
needs to protect itself against this. The attached patch fixes this issue.

patch -p1 < tmp.patch # from the directory that contains src/

David


> Regards,
> Jörg Blank
>
> _______________________________________________
> torqueusers mailing list
> torqueusers at supercluster.org
> http://www.supercluster.org/mailman/listinfo/torqueusers
>



-- 
David Beer | Senior Software Engineer
Adaptive Computing
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://www.supercluster.org/pipermail/torqueusers/attachments/20130304/675b7835/attachment.html 
-------------- next part --------------
A non-text attachment was scrubbed...
Name: tmp.patch
Type: application/octet-stream
Size: 593 bytes
Desc: not available
Url : http://www.supercluster.org/pipermail/torqueusers/attachments/20130304/675b7835/attachment.obj 


More information about the torqueusers mailing list