[torqueusers] Job stuck at E state forever
abhig at Princeton.EDU
Tue Mar 31 08:24:37 MDT 2009
Thanks for replying. The solution seems to be working. There were many
jobs stuck with E state and after making the config file with $usecp
statement, all jobs with E state disappeared.
Glen Beane wrote:
> On Tue, Mar 31, 2009 at 2:08 AM, Abhishek Gupta <abhig at princeton.edu> wrote:
>> Hi Halvor,
>> Some time ago I had the same problem and some people come up with the
>> problem saying that its because of rcp implementation in PBS which can be
>> changed, but I don't know how to change it tp cp or scp which can probably
>> solve the issue. If you have any idea about it, please let me know.
> if you have network mounted home directories, you can specify cp for
> returning the stdout/stderr files by using the mom config parameter
> on a lot of clusters you can get away with something like this: $usecp
> *:/home /home, which means any time torque needs to copy the
> stdout/stderr files to a home directory on the jobs submit host it
> will do a local copy to home instead
> If you need to use ssh, checkout this documentation:
-------------- next part --------------
An HTML attachment was scrubbed...
More information about the torqueusers