[torqueusers] Maui goes into sbwait state
csamuel at vpac.org
Sat Feb 23 20:56:45 MST 2008
----- "vaibhav agrawal" <agrvaibhav at gmail.com> wrote:
> Hi People,
> I have torque resource manager(ver. 2.1.6) with 100 FreeBSD boxes and
> using maui(3.2.6p16) as scheduler.
> maui often goes into sbwait state for a while(20-30 mins) and it
> doesn't schedule anything.
Given your top snippet shows pbs_server busy whilst Maui
is doing nothing it might be worth checking your Maui logs
at the time to see if it is complaining about not getting
any responses from pbs_server.
Now, as to why that might be happening, could it be a
dodgy node or flakey network ?
Worth looking at the pbs_server logs at the same time..
Christopher Samuel - (03) 9925 4751 - Systems Manager
The Victorian Partnership for Advanced Computing
P.O. Box 201, Carlton South, VIC 3053, Australia
VPAC is a not-for-profit Registered Research Agency
More information about the torqueusers