[torqueusers] Priv Port errors after upgrade to 4.1.5

David Beer dbeer at adaptivecomputing.com
Mon Mar 4 10:16:14 MST 2013


On Mon, Mar 4, 2013 at 10:15 AM, David Beer <dbeer at adaptivecomputing.com>wrote:

>
>
> On Mon, Mar 4, 2013 at 9:24 AM, Joerg Blank <j.blank at fz-juelich.de> wrote:
>
>> Hello everyone,
>>
>> I just tried to to upgrade from 4.1.4 to 4.1.5 and got a lot of priv
>> port errors after startup:
>>
>> 03/04/2013
>> 15:39:35;0001;PBS_Server.29263;Svr;PBS_Server;LOG_ERROR::Error getting
>> connection to socket (15096) in tcp_connect_sockaddr, Failed when trying
>> to get privileged port - socket_get_tcp_priv() failed
>>
>> I never saw this in 4.1.4 or earlier. Any idea?
>>
>> I also get a crash shortly after startup finished (qstat works for like
>> 5-10 secs)
>>
>> #1  0x00007fd4bf4c3c56 in *__GI___strdup (s=0x0) at strdup.c:42
>> #2  0x0000000000441876 in stat_to_mom (job_id=0x6656110
>> "31699[].glorim-1.cluster", cntl=0x7fd4b00811c0) at req_stat.c:901
>> #3  0x0000000000442aad in stat_mom_job (job_id=0x6656110
>> "31699[].glorim-1.cluster") at req_stat.c:1120
>> #4  0x0000000000442c1d in poll_job_task (ptask=0x66622f0) at
>> req_stat.c:1170
>> #5  0x000000000045b1a2 in work_thread (a=0x7fd4be39b920) at
>> u_threadpool.c:307
>> #6  0x00007fd4bf9b88ca in start_thread (arg=<value optimized out>) at
>> pthread_create.c:300
>> #7  0x00007fd4bf517b6d in clone () at
>> ../sysdeps/unix/sysv/linux/x86_64/clone.S:112
>>
>>
> This doesn't appear to be related to any new code, but for some reason you
> have a poll job task for a job that has no exec_host list. The routine
> needs to protect itself against this. The attached patch fixes this issue.
>
> patch -p1 < tmp.patch # from the directory that contains src/
>
>
By the way, we really appreciate your working with us on this. We are
constantly beefing up our regression tests, but actually deployment is
invaluable. Thank you.

David


> David
>
>
>> Regards,
>> Jörg Blank
>>
>> _______________________________________________
>> torqueusers mailing list
>> torqueusers at supercluster.org
>> http://www.supercluster.org/mailman/listinfo/torqueusers
>>
>
>
>
> --
> David Beer | Senior Software Engineer
> Adaptive Computing
>



-- 
David Beer | Senior Software Engineer
Adaptive Computing
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://www.supercluster.org/pipermail/torqueusers/attachments/20130304/46f96055/attachment.html 


More information about the torqueusers mailing list