[torqueusers] Trying to set up Torque and Maui for the 1st time

brown wrap gramos at yahoo.com
Fri Mar 7 15:49:50 MST 2014


I am working on my first cluster. I have Rocks 6.1 running with RHEL 6 as the os. I downloaded torque-4.2.6.1 and maui-3.3. Basically what is happening is I submit a job and it gets rejected:

[root at pro ~]# tracejob -a -l  -n 1 23
/var/spool/torque/mom_logs/20140307: No matching job records located

Job: 23.localhost.localdomain

03/07/2014 22:02:09  S    enqueuing into batch, state 1 hop 1
03/07/2014 22:02:10  S    Job Run at request of root at localhost
03/07/2014 22:02:10  S    send of job to compute-0-2.local failed error = 15010
03/07/2014 22:02:10  S    unable to run job, MOM rejected/rc=-1
03/07/2014 22:02:10  S    unable to run job, send to MOM '167903228' failed

On the client side:
03/07/2014 14:46:09;0001;   pbs_mom.11247;Svr;pbs_mom;LOG_ERROR::No child processes (10) in tcp_request, bad connect from 10.1.102.1:235
03/07/2014 14:46:54;0001;   pbs_mom.11247;Svr;pbs_mom;LOG_ERROR::No child processes (10) in tcp_request, bad connect from 10.1.102.1:966
03/07/2014 14:47:39;0001;   pbs_mom.11247;Svr;pbs_mom;LOG_ERROR::No child processes (10) in tcp_request, bad connect from 10.1.102.1:696
03/07/2014 14:48:24;0001;   pbs_mom.11247;Svr;pbs_mom;LOG_ERROR::No child processes (10) in tcp_request, bad connect from 10.1.102.1:656


I am not sure where to look for this problem. Any help would be appreciated. Thanks.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://www.supercluster.org/pipermail/torqueusers/attachments/20140307/fb2c62fd/attachment-0001.html 


More information about the torqueusers mailing list