[torquedev] [Bug 95] Support for GPUs

bugzilla-daemon at supercluster.org bugzilla-daemon at supercluster.org
Thu Nov 4 09:27:38 MDT 2010


http://www.clusterresources.com/bugzilla/show_bug.cgi?id=95

dbeer at adaptivecomputing.com changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |dbeer at adaptivecomputing.com

--- Comment #1 from dbeer at adaptivecomputing.com 2010-11-04 09:27:38 MDT ---
This is something that we have been thinking about at Adaptive as well. Our
thoughts for the first cycle through are a little bit differently than is
suggested.

In the first pass, we are planning to report GPUs just like pbs_server does
with processor assignment. We are going to allow the user to specify gpus=X in
the nodes file, and then TORQUE will track which GPU is assigned to which job
and report it. I'm thinking that $PBS_NODEFILE will add a line for each gpu:

hostname gpu<index>

And from there, each job should handle things (like grabbing the correct gpu),
just like TORQUE does by default with cpus. Eventually, we have a number of
features we would like to add, but first we want to release the feature as I
have described it, and once this is used and people have need for more, we will
add those features. I am convinced that this feature will be easy to add to
TORQUE (Ken and I are actually working on it right now and we will be done
soon) and will significantly improve GPU usage for our users. 

>From there, some of the features we want to add include the autodetection of
gpus and exclusive access to the assigned gpus.

-- 
Configure bugmail: http://www.clusterresources.com/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are on the CC list for the bug.


More information about the torquedev mailing list