[torqueusers] Not enough of the right type of nodes are available
vipingjo
vipingjo at gmail.com
Tue Aug 14 02:02:59 MDT 2012
Hi all,
I'm running torque on a virtual machine Ubuntu 10.04 LTS. Only one user is submitting jobs. But only one job is running. Can anyone tell me how to config to make more jobs running? Thanks!
Here are some commands and output. If more information is needed, please let me know.
$cat 37.N3.1.573.pbs
#!/bin/bash
#PBS -d /var/N3.0_lane_opt/
#PBS -o /var/37.N3.0.573.pbs.o
#PBS -e /var/37.N3.0.573.pbs.e
#PBS -l nodes=1:ppn=1
/usr/bin/time -v perl lt.pl
$cat 37.N3.1.576.pbs
#!/bin/bash
#PBS -d /var/N3.1_lane_opt/
#PBS -o /var/37.N3.1.576.pbs.o
#PBS -e /var/37.N3.1.576.pbs.e
#PBS -l nodes=1:ppn=1
/usr/bin/time -v perl opt.pl
$qstat
Job id Name User Time Use S Queue
------------------------- ---------------- --------------- -------- - -----
313.ubuntu 37.N3.1.573.pbs viping 00:28:32 R dque
314.ubuntu 37.N3.1.576.pbs viping 0 Q dque
$qstat -f 314
...
Priority = 0
Rerunable = True
Resource_List.ncpus = 1
Resource_List.nodect = 1
Resource_List.nodes = 1:ppn=1
Resource_List.walltime = 1000:00:00
...
comment = Not Running: Not enough of the right type of nodes are available
Qmgr: list queue dque
Queue dque
queue_type = Execution
total_jobs = 2
state_count = Transit:0 Queued:1 Held:0 Waiting:0 Running:1 Exiting:0
max_running = 4
resources_max.ncpus = 4
resources_max.nodes = 2
resources_min.ncpus = 1
resources_default.ncpus = 1
resources_default.neednodes = 1:ppn=1
resources_default.nodect = 1
resources_default.nodes = 1
resources_default.walltime = 1000:00:00
resources_assigned.ncpus = 1
resources_assigned.nodect = 1
max_user_run = 6
enabled = True
started = True
Qmgr: q
root at ubuntu:/usr/bin# qmgr
Max open servers: 4
Qmgr: list server
Server ubuntu
server_state = Active
scheduling = True
total_jobs = 2
state_count = Transit:0 Queued:1 Held:0 Waiting:0 Running:1 Exiting:0
acl_hosts = igenas
default_queue = dque
log_events = 511
mail_from = adm
resources_assigned.ncpus = 1
resources_assigned.nodect = 1
scheduler_iteration = 600
node_check_rate = 150
tcp_timeout = 6
pbs_version = 2.3.6
next_job_number = 315
net_counter = 6 2 1
#cat /var/lib/torque/server_name
igenas
Regards,
Viping
2012-08-14
vipingjo
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://www.supercluster.org/pipermail/torqueusers/attachments/20120814/1d0c184f/attachment-0001.html
More information about the torqueusers
mailing list