[torqueusers] qsub and mpiexec -f machinefile
Tiago Silva (Cefas)
tiago.silva at cefas.co.uk
Wed Feb 19 05:40:09 MST 2014
My MPI code is normally executed across a set of nodes with something like:
mpiexec -f machinefile -np 6 ./bin
where the machinefile has 6 entries with node names, for instance:
Now the issue here is that this list has been optimised to balance the load between nodes and to reduce internode communication. So for instance model domain tiles 0 and 1 will run on n01 while tiles 2 to 5 will run on n02.
Is there a way to integrate this into qsub since I don't know which nodes will be assigned before submission? Or in other words can I control grouping processes in one node?
In my example I used 6 processes for simplicity but normally I parallelise across 4-16 nodes and >100 processes.
This email and any attachments are intended for the named recipient only. Its unauthorised use, distribution, disclosure, storage or copying is not permitted.
If you have received it in error, please destroy all copies and notify the sender. In messages of a non-business nature, the views and opinions expressed are the author's own
and do not necessarily reflect those of Cefas.
Communications on Cefas’ computer systems may be monitored and/or recorded to secure the effective operation of the system and for other lawful purposes.
-------------- next part --------------
An HTML attachment was scrubbed...
More information about the torqueusers