[Mauiusers] Problem with Torque/Maui
S Ranjan
sranjan at ipr.res.in
Wed Jan 24 14:22:17 MST 2007
Hi
We have only pbs_server and maui running on the head node. pbs_mom is
not running on the head node.
Thanks & regards
Sutapa Ranjan
Donald Tripp wrote:
> What does
>
> ps -ef | grep pbs
>
> return? You should see something like this:
>
> ... /usr/local/pbs/i686/sbin/pbs_server
>
> if you see
>
> ... /usr/local/pbs/i686/sbin/pbs_mom -r
>
> then the head node is also configured to run jobs.
>
> also, check
>
> # chkconfig --list | grep pbs
> pbs_server 0:off 1:off 2:off 3:on 4:on 5:on 6:off
>
> you should only see pbs_server on the head node. If you see pbs_mom,
> then you need to turn it off and remove it
>
> # chkconfig pbs_mom off
> # chkconfig --del pbs_mom
>
>
>
> - Donald Tripp
> dtripp at hawaii.edu <mailto:dtripp at hawaii.edu>
> ----------------------------------------------
> HPC Systems Administrator
> High Performance Computing Center
> University of Hawai'i at Hilo
> 200 W. Kawili Street
> Hilo, Hawaii 96720
> http://www.hpc.uhh.hawaii.edu
>
>
> On Jan 23, 2007, at 5:21 PM, S Ranjan wrote:
>
>> Hi
>>
>> I have torque pbs_server running on the headnode, which is also the
>> submit host. There are 32 other compute nodes, mentioned in
>> /var/spool/torque/server_priv/nodes file. There is a single queue at
>> present. Sometimes, mpi jobs requesting for 28/30 nodes, land up
>> running on the head node, though the head node is not a compute node
>> at all. netstat -anp shows several sockets being openend for the
>> job, and eventually the head node hangs up.
>> Appreciate any help/suggestion on this.
>>
>> Sutapa
>> _______________________________________________
>> mauiusers mailing list
>> mauiusers at supercluster.org <mailto:mauiusers at supercluster.org>
>> http://www.supercluster.org/mailman/listinfo/mauiusers
>
>
More information about the mauiusers
mailing list