[torqueusers] error messages
Coyle, James J [ITACD]
jjc at iastate.edu
Mon Jan 28 13:20:36 MST 2013
Assuming you're the cluster admin, the easiest way is to place
$spool_as_final_name true
in
/var/spool/torque/mom_priv/config
on the batch nodes and restart the moms, then you don't have any output going to the modes.
This also allows users to look at their output while the job runs.
I do find that the output gets appended, so you might want to delete the output files before the job starts.
This also avoids the problem of users filling up
/var/spool/torque with junk. On my old smaller system, I had users writing 2GB plus output files.
It didn't take many to fill up /var and then subsequent "good" jobs failed due to /var being full.
James Coyle, PhD
High Performance Computing Group
Iowa State Univ.
web: http://jjc.public.iastate.edu/<http://www.public.iastate.edu/~jjc>
From: torqueusers-bounces at supercluster.org [mailto:torqueusers-bounces at supercluster.org] On Behalf Of Luiz Carlos dos Santos
Sent: Monday, January 28, 2013 8:23 AM
To: torqueusers at supercluster.org
Cc: luiz at if.usp.br
Subject: [torqueusers] error messages
My system has put error messages of jobs in the "/var/spool/torque/undelivered", in the node where the job is running, instead of the local directory where the job is installed. Please, how can I solve this problem.
Thanks,
Luiz Carlos dos Santos
Analista de Sistemas - IFUSP/FMT
Instituto de Física da USP
Departamento de Física dos Materiais e Mecânica
Pça. do Oceanográfico - Trav E, s/nº
Edifício Alessandro Volta, Bloco C - sala 112
CEP 05508-120 - São Paulo SP
Fone: (11) 3091-6784 / Fax: (11) 3091-6831
E-mail: luiz at if.usp.br<mailto:luiz at if.usp.br>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://www.supercluster.org/pipermail/torqueusers/attachments/20130128/fa068657/attachment.html
More information about the torqueusers
mailing list