Bug 122 - pbs_mom needs config option to trigger job specific temporary directory creation/removal on the Mother Superior only
: pbs_mom needs config option to trigger job specific temporary directory creat...
Status: NEW
Product: TORQUE
pbs_mom
: 2.4.x
: All Linux
: P5 normal
Assigned To: Ken Nielson
:
:
:
  Show dependency treegraph
 
Reported: 2011-04-19 19:56 MDT by Chris Samuel
Modified: 2011-07-12 21:26 MDT (History)
1 user (show)

See Also:


Attachments
Only remove $tmpdir on the mother superior, otherwise drop through (2.03 KB, patch)
2011-04-19 19:56 MDT, Chris Samuel
Details | Diff
New patch (against 2.4.15) to only remove $TMPDIR on the MS node (441 bytes, patch)
2011-07-12 21:26 MDT, Chris Samuel
Details | Diff


Note

You need to log in before you can comment on or make changes to this bug.


Description Chris Samuel 2011-04-19 19:56:06 MDT
Created an attachment (id=76) [details]
Only remove $tmpdir on the mother superior, otherwise drop through

/* Found on 2.4, but exists in 2.5, 3.0 and trunk too */

If you have a global scratch filesystem which is being used for your job
specific temporary directories via the pbs_mom directive $tmpdir then there
is no need for every pbs_mom in the job to create and remove the directory
at the same time.

It would be great to have a directive in the pbs_mom.conf file which would
cause the pbs_mom to only do the creation/removal on the Mother Superior
and not on the rest of the sisterhood.

At VLSCI we've had to hack this in unconditionally to work around an
apparent bug in the version of Panasas we've just upgraded to which
causes a large number of pbs_mom processes to hang if they all try
and remove the same directory tree at the same time.

I've attached the trivial patch we've used as an example (though of
course without the check for any configuration setting).
Comment 1 Chris Samuel 2011-07-12 21:26:12 MDT
Created an attachment (id=80) [details]
New patch (against 2.4.15) to only remove $TMPDIR on the MS node