[torqueusers] Maui probs

Kevin Van Workum vanw at sabalcore.com
Mon Dec 28 10:28:09 MST 2009


On Thu, Dec 24, 2009 at 1:37 PM, <skip at pobox.com> wrote:

>
> Sorry to bother this list with what I think is a Maui questions, but
> I've so far been unable to subscribe to the mauiusers mailing list, and
> my message to help at supercluster.org has gone unanswered.  I hope someone
> here can help me out.
>
> I fired up a few thousand batch jobs via qsub last night.  This morning,
> after standing reservations should have excluded new jobs from running
> (in my mind at least), I marked all nodes offline:
>
>    sudo pbsnodes -o $(pbsnodes -a | egrep '^[a-z]')
>
> That worked fine.
>
> However, now I want to start things up again.  I cleared the offline
> flag for a bunch of nodes:
>
>    sudo pbsnodes -c huron tuba ruth ...
>
> and sure enough, pbsnodes -a shows them as free, for example:
>
>    tuba.wacker
>         state = free
>         np = 1
>         ntype = cluster
>         status = opsys=solaris7,...
>
> Unfortunately, I don't see any running jobs:
>
>    % qstat | wc -l ; qstat | egrep ' R ' | wc -l
>        2601
>           0
>
> I would have thought free nodes could start hosting jobs again.  I must
> be misunderstanding something.  Any suggestions appreciated.
>

Try restarting Maui.


>
> --
> Skip Montanaro - skip at pobox.com - http://www.smontanaro.net/
> _______________________________________________
> torqueusers mailing list
> torqueusers at supercluster.org
> http://www.supercluster.org/mailman/listinfo/torqueusers
>



-- 
Kevin Van Workum, PhD
Sabalcore Computing Inc.
Run your code on 500 processors.
Sign up for a free trial account.
www.sabalcore.com
877-492-8027 ext. 11
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://www.supercluster.org/pipermail/torqueusers/attachments/20091228/ed672ba1/attachment.html 


More information about the torqueusers mailing list