[torqueusers] [torquedev] GPU Hardware

Christopher Samuel samuel at unimelb.edu.au
Tue Nov 2 16:52:46 MDT 2010


-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

On 22/10/10 03:00, "Mgr. Šimon Tóth" wrote:

> Several cards in one machine. Users should be able to
> select the amount of cards (or even specific type of
> card), Torque needs to make sure that each job will
> get its own requested cards (dedicated).

I guess the simplest way to do this would be for Torque
to set all the GPU cards to root only access on startup
(if no jobs are running) and then set file permissions
appropriately per job.

The main issue there will be if a user starts two jobs
on the same box then there will be the possibility of
clashes over which GPUs it can access.

The real solution would be cgroups with device whitelists
but (a) that's not available in RHEL/CentOS (yet) and
(b) there were reports that cgroups imposes quite a heavy
performance overhead when in use (not sure whether that's
still the case).

cheers,
Chris
- -- 
 Christopher Samuel - Senior Systems Administrator
 VLSCI - Victorian Life Sciences Computational Initiative
 Email: samuel at unimelb.edu.au Phone: +61 (0)3 903 55545
         http://www.vlsci.unimelb.edu.au/

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.10 (GNU/Linux)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org/

iEYEARECAAYFAkzQlj4ACgkQO2KABBYQAh9pqwCfaaB9VWdllDUIrq1Qb0sybUFg
mVAAnA3IJDzoyMn0AQWNybYfMHD0yckb
=JDfl
-----END PGP SIGNATURE-----


More information about the torqueusers mailing list