[Mauiusers] Problem with Torque/Maui

S Ranjan sranjan at ipr.res.in
Wed Jan 24 14:22:17 MST 2007


Hi

We have only pbs_server and maui running on the head node.  pbs_mom is 
not running on the head node.

Thanks & regards

Sutapa Ranjan


Donald Tripp wrote:

> What does
>
> ps -ef | grep pbs
>
> return? You should see something like this:
>
> ...  /usr/local/pbs/i686/sbin/pbs_server
>
> if you see 
>
> ... /usr/local/pbs/i686/sbin/pbs_mom -r 
>
>  then the head node is also configured to run jobs.
>
> also, check
>
> # chkconfig --list | grep pbs
> pbs_server      0:off   1:off   2:off   3:on    4:on    5:on    6:off
>
> you should only see pbs_server on the head node. If you see pbs_mom, 
> then you need to turn it off and remove it
>
> # chkconfig pbs_mom off
> # chkconfig --del pbs_mom
>
>
>
> - Donald Tripp
>  dtripp at hawaii.edu <mailto:dtripp at hawaii.edu>
> ----------------------------------------------
> HPC Systems Administrator
> High Performance Computing Center
> University of Hawai'i at Hilo
> 200 W. Kawili Street
> Hilo,   Hawaii   96720
> http://www.hpc.uhh.hawaii.edu
>
>
> On Jan 23, 2007, at 5:21 PM, S Ranjan wrote:
>
>> Hi
>>
>> I have torque pbs_server running on the headnode, which is also the 
>> submit host.  There are 32 other compute nodes, mentioned in 
>> /var/spool/torque/server_priv/nodes file.  There is a single queue at 
>> present.  Sometimes, mpi jobs requesting for 28/30 nodes, land up 
>> running on the head node, though the head node is not a compute node 
>> at all.  netstat -anp shows several sockets being openend for the 
>> job, and eventually the head node hangs up. 
>> Appreciate any help/suggestion on this.
>>
>> Sutapa
>> _______________________________________________
>> mauiusers mailing list
>> mauiusers at supercluster.org <mailto:mauiusers at supercluster.org>
>> http://www.supercluster.org/mailman/listinfo/mauiusers
>
>



More information about the mauiusers mailing list