[torqueusers] Using TORQUE in a supercomputer with lots of CPU's - one node - gets job-exclusive

David Beer dbeer at adaptivecomputing.com
Thu Feb 6 09:54:34 MST 2014


I'm not sure how the Maui side of the configuration needs to be updated.


On Thu, Feb 6, 2014 at 9:54 AM, David Beer <dbeer at adaptivecomputing.com>wrote:

> Yes, that'd be the recommended way to do it.
>
>
> On Thu, Feb 6, 2014 at 9:57 AM, Silas Silva <silasdb at gmail.com> wrote:
>
>> Yes.  This is SGI Altix.
>>
>> I see, so I have to compile it with --enable-numa-suport?  TORQUE admin
>> guide is explanatory about that...
>>
>> Thank you!
>>
>> On Thu, Feb 06, 2014 at 09:38:45AM -0700, David Beer wrote:
>> > Silas,
>> >
>> > Does this system use NUMA architecture?
>> >
>> >
>> > On Thu, Feb 6, 2014 at 7:34 AM, Silas Silva <silasdb at gmail.com> wrote:
>> >
>> > > Hi there!
>> > >
>> > > I installed TORQUE and Maui successfully in a supercomputer, with 136
>> > > processors.  So, as you can see, this is not a beowulf cluster.
>>  TORQUE
>> > > and Maui runs fine, but after the first job is running, the node (the
>> > > computer itself) state turns to be job-exclusive, so I can't run any
>> > > more job even though there are dozens of processors free.
>> > >
>> > > How to configure it (server_priv/nodes only?) to recognize other CPU's
>> > > as a different partition that jobs can be allocated to?
>> > >
>> > > Is multi-mom configuration the answer?
>> > >
>> > > Thank you!
>> > >
>> > > --
>> > > Silas Silva
>> > > _______________________________________________
>> > > torqueusers mailing list
>> > > torqueusers at supercluster.org
>> > > http://www.supercluster.org/mailman/listinfo/torqueusers
>> > >
>> >
>> >
>> >
>> > --
>> > David Beer | Senior Software Engineer
>> > Adaptive Computing
>>
>> > _______________________________________________
>> > torqueusers mailing list
>> > torqueusers at supercluster.org
>> > http://www.supercluster.org/mailman/listinfo/torqueusers
>>
>>
>> --
>> Silas Silva
>> _______________________________________________
>> torqueusers mailing list
>> torqueusers at supercluster.org
>> http://www.supercluster.org/mailman/listinfo/torqueusers
>>
>
>
>
> --
> David Beer | Senior Software Engineer
> Adaptive Computing
>



-- 
David Beer | Senior Software Engineer
Adaptive Computing
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://www.supercluster.org/pipermail/torqueusers/attachments/20140206/8b51677d/attachment.html 


More information about the torqueusers mailing list