[torqueusers] How to debug torque/maui

Jacques Foury Jacques.Foury at math.u-bordeaux1.fr
Thu Aug 30 08:05:51 MDT 2007


Albert Shih a écrit :
> Hi all,
>
>
> Stupid question from a newbie.
>
> I'm suppose to manage a cluster torque/maui, but when the are some problem
> I don't known the tools (like grep/vi) and where to find the problem.
>
> For example : 
>
> 	Actually I've a user submit through qsub a job, this job don't
> 	start. I think I known the reason. But how/where can I find some
> 	information about this job ? 
>
> where can I find some document like «torque/maui for dummies-sys-admin» 
>
> Regards.
>   

Hi Albert.

The main log files are :
/var/spool/maui/log/maui.log

you can specify its size in maui.cfg :

LOGFILE                 maui.log
LOGFILEMAXSIZE          50000000
LOGLEVEL                3
LOGFILEROLLDEPTH        3


For Torque it's not easier :

/var/lib/torque/server_log/20070830   (one file each day)


If you know the node where the job goes (if it starts), you have another
log file there in /var/lib/torque/mom_log

Then you become a cop : look for the events in each file to understand
what happens ;-)


And about the tools, I use grep, vi, awk, cut and so on ;-)

-- 

Jacques Foury
Institut de Mathématiques de Bordeaux
Université Bordeaux 1 / CNRS





More information about the torqueusers mailing list