[torquedev] cpuset support
Prakash Velayutham
prakash.velayutham at cchmc.org
Tue Nov 13 11:27:24 MST 2007
Hi Garrick,
Can I ask a stupid question like, what does the current cpuset code in
resmom do?
Thanks,
Prakash
On Nov 12, 2007, at 7:17 PM, Garrick Staples wrote:
> I just bumped into Chris Samuel at his (rather barren) booth here at
> SC07 and I think we just designed cpuset support.
>
> Here's what we came up with...
>
> On startup, pbs_mom will create /dev/cpuset/torque (with all cpus)
> if it
> doesn't already exist and move itself to it. This allows the admin
> to stuff
> pbs_mom inside a smaller cpuset if desired by creating it in the
> initscript.
> We will call this the "torqueset".
>
> When a job starts, pbs_mom will create a per-job cpuset under the
> torqueset
> with the correct cpus called the "jobset". It will do this after
> prologue,
> which allows the admin to pre-create it if desired. This happens on
> all nodes.
>
> Also, per-vnode cpusets will also be created under the jobset at job
> start.
>
> pbs_mom will run the batch script inside of the jobset and all TM
> spawn
> requests will run in the vnodeset.
>
> You end up with cpusets that look like:
> /dev/cpuset/torque/job-123.pbsserver.foo.edu/vnode-4
>
> Job exit will consist of ensuring the cpusets are empty (killing
> processes)
> before removing them.
>
> Exclusive cpusets can't be used because of suspended jobs.
>
> All mems will be added to all cpusets unless someone comes up with
> another idea.
>
> This seems pretty simple to implement, doesn't require any build
> deps, and
> makes sense to me. Any thoughts?
More information about the torquedev
mailing list