[torqueusers] No Email notification

Jakob Blomqvist Jakob.Blomqvist at mah.se
Mon Mar 7 05:23:18 MST 2011


I realized I could not send you the txt-attachements. Here are some of the stuff:


************************************
qstat.log:
Job Id: 101.super1
    Job_Name = scr.sh
    Job_Owner = jakob at super1
    resources_used.cput = 00:00:00
    resources_used.mem = 0kb
    resources_used.vmem = 0kb
    resources_used.walltime = 00:00:01
    job_state = C
    queue = batch
    server = super1
    Checkpoint = u
    ctime = Mon Mar  7 12:03:57 2011
    Error_Path = super1:/home/jakob/TEST/scr.sh.e101
    exec_host = super1/2+super1/1+super1/0
    Hold_Types = n
    Join_Path = n
    Keep_Files = n
    Mail_Points = abe
    Mail_Users = jakob.blomqvist at mah.se
    mtime = Mon Mar  7 12:03:58 2011
    Output_Path = super1:/home/jakob/TEST/scr.sh.o101
    Priority = 0
    qtime = Mon Mar  7 12:03:57 2011
    Rerunable = True
    Resource_List.ncpus = 1
    Resource_List.neednodes = 1:ppn=3
    Resource_List.nodect = 1
    Resource_List.nodes = 1:ppn=3
    Resource_List.walltime = 00:05:00
    session_id = 7686
    substate = 59
    Variable_List = PBS_O_HOME=/home/jakob,PBS_O_LANG=en_US.UTF-8,
        PBS_O_LOGNAME=jakob,
        PBS_O_PATH=/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/b
        in:/usr/games:/home/jakob/scripts:/usr/local/bin/FullProf_Suite:,
        PBS_O_MAIL=/var/mail/jakob,PBS_O_SHELL=/bin/bash,PBS_O_HOST=super1,
        PBS_SERVER=super1,PBS_O_WORKDIR=/home/jakob/TEST,PBS_O_QUEUE=batch
    euser = jakob
    egroup = jakob
    hashname = 101.super1
    queue_rank = 13
    queue_type = E
    comment = Job started on Mon Mar 07 at 12:03
    etime = Mon Mar  7 12:03:57 2011
    exit_status = 0
    submit_args = scr.sh
    start_time = Mon Mar  7 12:03:57 2011
    start_count = 1
    fault_tolerant = False
    comp_time = Mon Mar  7 12:03:58 2011
**************************************

**************************************
server_log.file:

03/07/2011 12:16:25;0100;PBS_Server;Job;102.super1;enqueuing into batch, state 1 hop 1
03/07/2011 12:16:25;0008;PBS_Server;Job;102.super1;Job Queued at request of jakob at super1, owner = jakob at super1, job name = scr.sh, queue = batch
03/07/2011 12:16:25;0040;PBS_Server;Svr;super1;Scheduler was sent the command new
03/07/2011 12:16:25;0008;PBS_Server;Job;102.super1;Job Modified at request of Scheduler at super1
03/07/2011 12:16:25;0008;PBS_Server;Job;102.super1;Job Run at request of Scheduler at super1
03/07/2011 12:16:25;0040;PBS_Server;Svr;super1;Scheduler was sent the command recyc
03/07/2011 12:16:25;0010;PBS_Server;Job;102.super1;Exit_status=0 resources_used.cput=00:00:00 resources_used.mem=0kb resources_used.vmem=0kb resources_used.walltime=00:00:00
03/07/2011 12:16:25;0040;PBS_Server;Svr;super1;Scheduler was sent the command term
03/07/2011 12:17:05;0002;PBS_Server;Svr;PBS_Server;Torque Server Version = 2.4.3, loglevel = 0
03/07/2011 12:17:25;0100;PBS_Server;Job;102.super1;dequeuing from batch, state COMPLETE
03/07/2011 12:17:25;0040;PBS_Server;Svr;super1;Scheduler was sent the command term
03/07/2011 12:22:05;0002;PBS_Server;Svr;PBS_Server;Torque Server Version = 2.4.3, loglevel = 0
03/07/2011 12:27:05;0002;PBS_Server;Svr;PBS_Server;Torque Server Version = 2.4.3, loglevel = 0
03/07/2011 12:27:25;0040;PBS_Server;Svr;super1;Scheduler was sent the command time
****************************************

****************************************
scr.sh:


!/bin/sh
#PBS -l nodes=1:ppn=3
#PBS -l walltime=0:5:00
#PBS -m abe
#PBS -M xxx.xxx at xxx.se

cd $PBS_O_WORKDIR
echo "This is workdir"
echo `pwd`
echo mail location is `which mail`
***************************************

***************************************
scr.sh.o102:

This is workdir
/home/jakob/TEST
mail location is /usr/bin/mail
***************************************




Dr. Jakob Blomquist
IMP, School of Technology
Malmo University
SWEDEN
+46(0)40 6657626
jakob.blomqvist at mah.se
>>> David Beer  03/03/11 7:05 PM >>>

----- Original Message -----
> Hi,
> I have set up Torque-2.4.3 (Server, MOM and Sched) on a single node,
> 16 core AMD machine for scientific calculations, with Ubuntu 10.10
> installed. The machine has a permanent ip address within a university
> network. It is accessed to by ssh. I finally managed to get the qsub
> to work, however email notification will not work. My ./configure was
> following:
> $ ./configure --disable-gcc-warnings
> --with-server-home=/var/spool/pbs/ --with-sendmail=mail
> followed by
> $ make -j 16
> and
> $ sudo make install
> 
> I have mailx installed configured to use university outgoing
> smpt-domain and following command make me receive a testmail in my
> mailbox:
> $ mail -s 'test message' XXXX.XXXX at XXX.se < /dev/null
> 
> I don't even know where to see if torque tried to send mail at all.
> Obviously I use the -m flag and set a mail using -M. But no luck.
> 
> Dr. Jakob Blomquist
> IMP, School of Technology
> Malmo University
> SWEDEN
> +46(0)40 6657626
> jakob.blomqvist at mah.se
> 

Jakob,

I would check for errors in the pbs_server's log file. You could serach the file for a job id that didn't send the job or look for svr_mailowner.

Cheers,

-- 
David Beer 
Direct Line: 801-717-3386 | Fax: 801-717-3738
     Adaptive Computing
     1656 S. East Bay Blvd. Suite #300
     Provo, UT 84606

_______________________________________________
torqueusers mailing list
torqueusers at supercluster.org
http://www.supercluster.org/mailman/listinfo/torqueusers

-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://www.supercluster.org/pipermail/torqueusers/attachments/20110307/597e1cab/attachment.html 


More information about the torqueusers mailing list