[torqueusers] Trying to set up Torque and Maui for the 1st time

David Beer dbeer at adaptivecomputing.com
Mon Mar 10 10:25:02 MDT 2014


Do you have an active firewall between pbs_server and the nodes? Is
10.1.102.1 the address of your head node?


On Fri, Mar 7, 2014 at 3:49 PM, brown wrap <gramos at yahoo.com> wrote:

> I am working on my first cluster. I have Rocks 6.1 running with RHEL 6 as
> the os. I downloaded torque-4.2.6.1 and maui-3.3. Basically what is
> happening is I submit a job and it gets rejected:
>
> [root at pro ~]# tracejob -a -l  -n 1 23
> /var/spool/torque/mom_logs/20140307: No matching job records located
>
> Job: 23.localhost.localdomain
>
> 03/07/2014 22:02:09  S    enqueuing into batch, state 1 hop 1
> 03/07/2014 22:02:10  S    Job Run at request of root at localhost
> 03/07/2014 22:02:10  S    send of job to compute-0-2.local failed error =
> 15010
> 03/07/2014 22:02:10  S    unable to run job, MOM rejected/rc=-1
> 03/07/2014 22:02:10  S    unable to run job, send to MOM '167903228' failed
>
> On the client side:
> 03/07/2014 14:46:09;0001;   pbs_mom.11247;Svr;pbs_mom;LOG_ERROR::No child
> processes (10) in tcp_request, bad connect from 10.1.102.1:235
> 03/07/2014 14:46:54;0001;   pbs_mom.11247;Svr;pbs_mom;LOG_ERROR::No child
> processes (10) in tcp_request, bad connect from 10.1.102.1:966
> 03/07/2014 14:47:39;0001;   pbs_mom.11247;Svr;pbs_mom;LOG_ERROR::No child
> processes (10) in tcp_request, bad connect from 10.1.102.1:696
> 03/07/2014 14:48:24;0001;   pbs_mom.11247;Svr;pbs_mom;LOG_ERROR::No child
> processes (10) in tcp_request, bad connect from 10.1.102.1:656
>
> I am not sure where to look for this problem. Any help would be
> appreciated. Thanks.
>
>
> _______________________________________________
> torqueusers mailing list
> torqueusers at supercluster.org
> http://www.supercluster.org/mailman/listinfo/torqueusers
>
>


-- 
David Beer | Senior Software Engineer
Adaptive Computing
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://www.supercluster.org/pipermail/torqueusers/attachments/20140310/2348f323/attachment.html 


More information about the torqueusers mailing list