[torqueusers] [torquedev] GPU Hardware
samuel at unimelb.edu.au
Tue Nov 2 16:52:46 MDT 2010
-----BEGIN PGP SIGNED MESSAGE-----
On 22/10/10 03:00, "Mgr. Šimon Tóth" wrote:
> Several cards in one machine. Users should be able to
> select the amount of cards (or even specific type of
> card), Torque needs to make sure that each job will
> get its own requested cards (dedicated).
I guess the simplest way to do this would be for Torque
to set all the GPU cards to root only access on startup
(if no jobs are running) and then set file permissions
appropriately per job.
The main issue there will be if a user starts two jobs
on the same box then there will be the possibility of
clashes over which GPUs it can access.
The real solution would be cgroups with device whitelists
but (a) that's not available in RHEL/CentOS (yet) and
(b) there were reports that cgroups imposes quite a heavy
performance overhead when in use (not sure whether that's
still the case).
Christopher Samuel - Senior Systems Administrator
VLSCI - Victorian Life Sciences Computational Initiative
Email: samuel at unimelb.edu.au Phone: +61 (0)3 903 55545
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.10 (GNU/Linux)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org/
-----END PGP SIGNATURE-----
More information about the torqueusers