[torqueusers] Re: [Mauiusers] no file STDIN.e# and STDIN.o#

Danny Sternkopf dsternkopf at hpce.nec.com
Tue Aug 5 07:45:19 MDT 2008


Hi,

your $PBS_SPOOL/mom_logs/<logfile> will tell you why it is not there.

Best regards,

Danny

Yanan Sun wrote:
> -bash-3.1$ echo "perl helloworld.pl node001" |qsub -q short -l nodes=node001
> 129.master.perceus.centos
> -bash-3.1$ qstat -f
> Job Id: 129.master.perceus.centos
>     Job_Name = STDIN
>     Job_Owner = ys at master.perceus.centos
>     resources_used.cput = 00:00:00
>     resources_used.mem = 0kb
>     resources_used.vmem = 0kb
>     resources_used.walltime = 00:00:00
>     job_state = E
>     queue = short
>     server = master.perceus.centos
>     Checkpoint = u
>     ctime = Tue Aug  5 09:37:01 2008
>     Error_Path = master.perceus.centos:/home/ys/STDIN.e129
>     exec_host = node001/0
>     Hold_Types = n
>     Join_Path = n
>     Keep_Files = n
>     Mail_Points = a
>     mtime = Tue Aug  5 09:37:02 2008
>     Output_Path = master.perceus.centos:/home/ys/STDIN.o129
>     Priority = 0
>     qtime = Tue Aug  5 09:37:01 2008
>     Rerunable = True
>     Resource_List.nodect = 1
>     Resource_List.nodes = node001
>     Resource_List.walltime = 00:05:00
>     session_id = 6452
>     Variable_List = PBS_O_HOME=/home/ys,PBS_O_LANG=en_US.UTF-8,
> 	PBS_O_LOGNAME=ys,
> 	PBS_O_PATH=/usr/kerberos/bin:/usr/local/bin:/bin:/usr/bin:/home/ys/Desktop/cmake-2.6.0-Linux-i386/bin,
> 	PBS_O_MAIL=/var/spool/mail/ys,PBS_O_SHELL=/bin/bash,
> 	PBS_O_HOST=master.perceus.centos,PBS_O_WORKDIR=/home/ys,
> 	PBS_O_QUEUE=short
>     etime = Tue Aug  5 09:37:01 2008
>     exit_status = 126
>     submit_args = -q short -l nodes=node001
> 
> so the error file should be at /home/ys/
> but it was not there
> 
> thanks.
> 
> Yanan
> 
> 
> On Fri, Aug 1, 2008 at 3:28 AM, Danny Sternkopf <dsternkopf at hpce.nec.com> wrote:
>> Hi,
>>
>> check you MOM log file in $PBS_SPOOL/mom_logs/ on the node which was set
>> offline.
>>
>> So there is a difference between node001 and node002.
>>
>> If you run 'qstat -f' on the job you see where the *.o and *.o files will be
>> stored. Check the name of the target host if it can be accessed from these
>> two nodes.
>>
>> Best regards,
>>
>> Danny
>>
>> Yanan Sun wrote:
>>> i added two nodes on the cluster, node001 and node002.
>>> if i keep both free, i don't get any STDIN.e# and STDIN.o# files.
>>> if i put one offline, i got both files.
>>> anyone knows why?
>>>
>>>
>>> Yanan
>>> _______________________________________________
>>> mauiusers mailing list
>>> mauiusers at supercluster.org
>>> http://www.supercluster.org/mailman/listinfo/mauiusers
>>>
>> _______________________________________________
>> torqueusers mailing list
>> torqueusers at supercluster.org
>> http://www.supercluster.org/mailman/listinfo/torqueusers
>>
> 

-- 
Danny Sternkopf http://www.nec.de/hpc       dsternkopf at hpce.nec.com
HPCE Division  Germany phone: +49-711-68770-35 fax: +49-711-6877145
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
NEC Deutschland GmbH, Hansaallee 101, 40549 Düsseldorf
Geschäftsführer Yuya Momose
Handelsregister Düsseldorf HRB 57941; VAT ID DE129424743



More information about the torqueusers mailing list