[torqueusers] torque-1.2.0p1-snap.1107893767 will not compile
on AIX 5.2 with XLC (and how I worked around it)
Bas van der Vlies
basv at sara.nl
Thu Feb 17 01:11:24 MST 2005
Garrick Staples wrote:
> On Wed, Feb 16, 2005 at 12:26:23PM +0100, Bas van der Vlies alleged:
releases. Please test!
>>
>>I have set various values for $jobstartblocktime, (0 --> 20), but i did
>>not see any slow down in qstat. The load on this test system is not huge.
>
>
> If your pro/epilogues aren't more than 3 or 4 seconds, you'll never notice a
> difference.
>
Thanks for the explanation.
>
>
>>>Another shaky area is with restarting pbs_mom daemons. It should now
>>>be possible to restart any daemon at any time without breaking jobs.
>>>pbsdsh has been enhanced to live in this world of restarting moms. I
>>>can already tell you that mpiexec won't deal with it properly. I'm
>>>worried about these changes effecting the recoverability of failing
>>>jobs. Please test!
>>
>>Must i specify an option to pbs_mom to enable restart the jobs. like
>>'-p' or must it work out of the box. I have tried it without options and
>>the jobs get restarted and an interactive job is killed.
>>
>>With the '-p' option:
>> - Interactive job will be killed
>> - an job is not restarted
>
>
> It should now work as advertised in the pbs_mom manpage. -p would be used to
> recover jobs after restarting mom.
>
> If you run 'pbs_mom -p' under PBSDEBUG you'll see messages about recovering and
> saving stderr, stdout, nodeid, and taskid numbers.
>
Ok i have to use the '-p' option for pbs_mom. Just an quick question
does it also works for multi node jobs
Regards
--
--
********************************************************************
* *
* Bas van der Vlies e-mail: basv at sara.nl *
* SARA - Academic Computing Services phone: +31 20 592 8012 *
* Kruislaan 415 fax: +31 20 6683167 *
* 1098 SJ Amsterdam *
* *
********************************************************************
More information about the torqueusers
mailing list