[torqueusers] Jobs stay at status "E"

Clotho Tsang wytsang at clustertech.com
Mon Apr 15 20:48:05 MDT 2013


Sometimes I find that jobs stay at status "E".

After some investigation, it is because the computation nodes
unable to scp files back to the job submission node.

One possible cause is that password-less ssh is not set.
One can find the detail error message at /var/log/message
of the computation node.

-- 
Clotho Tsang
Senior Software Engineer
Cluster Technology Limited
Email: clotho at clustertech.com
Tel: (852) 2655-6129
Fax: (852) 2994-2101
Website: www.clustertech.com
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://www.supercluster.org/pipermail/torqueusers/attachments/20130416/de6c499a/attachment.html 


More information about the torqueusers mailing list