[torqueusers] Torque not honoring max_user_queuable

Ti Leggett leggett at mcs.anl.gov
Thu Feb 9 14:41:12 MST 2012


Is there more configuration I need to do to make this effective with a routing queue?

On Feb 4, 2012, at 11:44 AM, Ti Leggett wrote:

> All jobs go through a routing queue.
> 
> On Feb 3, 2012, at 11:44 AM, David Beer wrote:
> 
>> I'm also curious - is this done through a routing queue or routing queues? Is it class remapping in Moab? It looks like it isn't qsub -q <queue>
>> 
>> David
>> 
>> On Fri, Feb 3, 2012 at 10:15 AM, Ti Leggett <leggett at mcs.anl.gov> wrote:
>>   submit_args = -A CI-MCB000083 -l walltime=48:00:00,
>>       mppwidth=48 /lustre/beagle/linpyl/project.qsub
>> 
>> On Feb 3, 2012, at 11:03 AM, David Beer wrote:
>> 
>>> If you qstat -f a few of the jobs you can see the submit arguments. At higher log levels the entire job submission is there, but I don't known if your log levels would be that high.
>>> 
>>> David
>>> 
>>> On Fri, Feb 3, 2012 at 9:21 AM, Ti Leggett <leggett at mcs.anl.gov> wrote:
>>> I'm assuming using qsub, but it's other users doing this so I'm not 100% sure. Is there a way to find out from logs or other tools?
>>> 
>>> On Feb 3, 2012, at 10:06 AM, David Beer wrote:
>>> 
>>>> Ti,
>>>> 
>>>> How are you submitting the jobs? I assume this is TORQUE 2.5.9?
>>>> 
>>>> David
>>>> 
>>>> On Fri, Feb 3, 2012 at 8:27 AM, Ti Leggett <leggett at mcs.anl.gov> wrote:
>>>> We've set queue limits that don't seem to be honored:
>>>> 
>>>> sdb:~ # qstat | grep linpyl | grep batch | wc
>>>>   945    5670   82215
>>>> 
>>>> sdb:~ # qmgr -c "print queue batch"
>>>> #
>>>> # Create queues and set their attributes.
>>>> #
>>>> #
>>>> # Create and define queue batch
>>>> #
>>>> create queue batch
>>>> set queue batch queue_type = Execution
>>>> set queue batch max_user_queuable = 500
>>>> set queue batch resources_min.mppwidth = 1
>>>> set queue batch resources_default.mppwidth = 24
>>>> set queue batch resources_default.walltime = 00:10:00
>>>> set queue batch acl_group_enable = False
>>>> set queue batch resources_available.nodes = 726
>>>> set queue batch enabled = True
>>>> set queue batch started = True
>>>> 
>>>> How would it be possible for a user to have 945 jobs in the queue when the limit should be 500?
>>>> _______________________________________________
>>>> torqueusers mailing list
>>>> torqueusers at supercluster.org
>>>> http://www.supercluster.org/mailman/listinfo/torqueusers
>>>> 
>>>> 
>>>> 
>>>> 
>>>> --
>>>> David Beer | Software Engineer
>>>> Adaptive Computing
>>>> 
>>>> _______________________________________________
>>>> torqueusers mailing list
>>>> torqueusers at supercluster.org
>>>> http://www.supercluster.org/mailman/listinfo/torqueusers
>>> 
>>> 
>>> _______________________________________________
>>> torqueusers mailing list
>>> torqueusers at supercluster.org
>>> http://www.supercluster.org/mailman/listinfo/torqueusers
>>> 
>>> 
>>> 
>>> 
>>> --
>>> David Beer | Software Engineer
>>> Adaptive Computing
>>> 
>>> _______________________________________________
>>> torqueusers mailing list
>>> torqueusers at supercluster.org
>>> http://www.supercluster.org/mailman/listinfo/torqueusers
>> 
>> 
>> _______________________________________________
>> torqueusers mailing list
>> torqueusers at supercluster.org
>> http://www.supercluster.org/mailman/listinfo/torqueusers
>> 
>> 
>> 
>> 
>> -- 
>> David Beer | Software Engineer
>> Adaptive Computing
>> 
>> _______________________________________________
>> torqueusers mailing list
>> torqueusers at supercluster.org
>> http://www.supercluster.org/mailman/listinfo/torqueusers
> 

-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 203 bytes
Desc: Message signed with OpenPGP using GPGMail
Url : http://www.supercluster.org/pipermail/torqueusers/attachments/20120209/466c7202/attachment-0001.bin 


More information about the torqueusers mailing list