[torqueusers] qsub jobs in queue, but not execeuted

jupiter jupiter.hce at gmail.com
Fri Jun 14 00:35:19 MDT 2013


Hi,

I am new to the list, sorry for asking FAQ. I've just installed a
server and a node mom from source 4.2.3 on CentOS 6.4. I am testing it
using pbs_sched. I can submit jobs but all jobs remain in queue (all
jobs are simply bash scripts with one line of "echo test").

$ qstat
Job ID                    Name             User            Time Use S Queue
------------------------- ---------------- --------------- -------- - -----
1.login                    job1.sh          tester                0 Q
batch
2.login                    job2.sh          tester                0 Q
batch
3.login                    job3.sh          tester                0 Q batch

The communication between the server and the node seems fine, but the
jobs did not move to the node:

$ pbsnodes
desktop3
     state = free
     np = 1
     ntype = cluster
     status = rectime=1371190614,varattr=,jobs=,state=free,netload=12520819104,gres=,loadave=0.00,ncpus=1,physmem=3923096kb,availmem=3550292kb,totmem=3923096kb,idletime=690945,nusers=2,nsessions=10,sessions=23723
23914 23939 23957 23987 24817 28528 28553 28571 28604,uname=Linux
desktop3 2.6.32-358.2.1.el6.x86_64 #1 SMP Wed Mar 13 00:26:49 UTC 2013
x86_64,opsys=linux
     mom_service_port = 15002
     mom_manager_port = 15003

There is not error in sched_logs

# tail -f 20130613
06/13/2013 16:41:45;0002; pbs_sched.14843;Svr;Log;Log opened
06/13/2013 16:41:45;0002; pbs_sched.14843;Svr;TokenAct;Account file
/var/spool/torque/sched_priv/accounting/20130613 opened
06/13/2013 16:41:45;0002; pbs_sched.14844;Svr;main;pbs_sched startup pid 14844
06/13/2013 16:44:01;0080; pbs_sched.14844;Svr;main;brk point 10272768
06/13/2013 16:54:05;0080; pbs_sched.14844;Svr;main;brk point 10534912


I suspect that the pbs_sched is not configured properly, but there is
no detail how to set up pbs_sched in the torqueAdminGuide-4.0.2.pdf. I
know MAUI / MOAB should be used, but at this stage I just want to test
the server and node. Appreciate any advice.

By the way, on mom node, is it necessary to run trqauthd, I thought it
is only used for authentication, and I really don't care the
authentication at this stage.

Thank you.

Kind regards,

j


More information about the torqueusers mailing list