[torqueusers] specific nodes
Lloyd Brown
lloyd_brown at byu.edu
Wed Nov 30 12:52:12 MST 2011
Ricardo,
Have you seen section 4.1.4 ("Mapping a Queue to a Subset of Resources")
in the Torque documentation? It might give you some ideas. However,
the short answer to your question, as seen in that section is this:
> TORQUE does not currently provide a simple mechanism for mapping queues to nodes. However, schedulers such as Moab and Maui can provide this functionality.
Lloyd Brown
Systems Administrator
Fulton Supercomputing Lab
Brigham Young University
http://marylou.byu.edu
On 11/30/2011 12:37 PM, Ricardo Román Brenes wrote:
> Hello everyone thanks for the time of reading and the long post :P
>
>
> The question is about multiple queues with Torque:
>
>
> We have here different clusternodes with difrente architectures:
> 4 PS-3
> 3 CPU+GPU
> 2 CPU
>
> and i want to be able to send jobs to each of hte nodes independly
> (using torque). Im guessing that having several queues and that each
> node belonging to a queue in particular and then submittint jobs to that
> queue will do the trick:
>
> say i got 4 queues
> IBMCELL with the 4 PS-3
> TESLA with the 3 nodes that have GPUs
> XEON with te 5 nodes that have Xeons (which in turn 3 of them have
> teslas :P)
>
> and when i submit a job:
> qsub -q IBMCELL a.pbs
> should run on the PS-3 only, but im not being able to make it work like
> that.
>
> As a test i made 2 queues in the PS3 pbs_server ("uno" and "dos"):
>
> #
> # Create queues and set their attributes.
> #
> #
> # Create and define queue uno
> #
> *create queue uno
> **set queue uno queue_type = Execution
> **set queue uno acl_host_enable = False
> **set queue uno acl_hosts = zarate-0+zarate-1
> **set queue uno enabled = True
> **set queue uno started = True
> *#
> # Create and define queue dos
> #
> *create queue dos
> **set queue dos queue_type = Execution
> **set queue dos acl_host_enable = **False**
> **set queue dos acl_hosts = zarate-2+zarate-3
> **set queue dos enabled = True
> **set queue dos started = True
> *#
> # Set server attributes.
> #
> set server scheduling = True
> set server acl_hosts = zarate-0
> set server log_events = 511
> set server mail_from = adm
> set server scheduler_iteration = 600
> set server node_check_rate = 150
> set server tcp_timeout = 6
> set server next_job_number = 22
>
>
> and i changed the _nodes_ file in the server_priv directory so it is
> like this (zarate are just the hostname :P):
>
>
> zarate-0 np=2 uno
> zarate-1 np=2 uno
> zarate-2 np=2 dos
> zarate-3 np=2 dos
>
>
>
> but its not working... when i launch a job:
>
> #PBS -N mpi_hello
> /usr/local/bin/mpiexec -n 8 /home/rroman/a.out
>
>
> with teh command:
>
> #PBS -N mpi_hello
>
> /usr/local/bin/mpiexec -n 8 /home/rroman/a.out
>
>
> the output file is:
>
> zarate-1: hello world from process 2 of 8
> zarate-2: hello world from process 5 of 8
> zarate-2: hello world from process 6 of 8
> zarate-3: hello world from process 0 of 8
> zarate-3: hello world from process 7 of 8
> zarate-1: hello world from process 3 of 8
> zarate-0: hello world from process 4 of 8
> zarate-3: hello world from process 1 of 8
>
>
>
> And there it shows that the job is running in ALL the nodes instead of
> running only in zarate-0 and zarate-1 as the queue said (according to me :P)
>
>
>
>
> SO! the question is: is it possible to do waht i want like this? and if
> so, what am i doing wrong! :P
>
> Thank you Kay!
>
> -ricardo
>
>
>
> _______________________________________________
> torqueusers mailing list
> torqueusers at supercluster.org
> http://www.supercluster.org/mailman/listinfo/torqueusers
More information about the torqueusers
mailing list