[torqueusers] Problems with maui scalability
Lennart Karlsson
Lennart.Karlsson at nsc.liu.se
Wed Sep 12 01:14:53 MDT 2007
meo at intrinsity.com said:
> Peter Wyckoff said...
>
> |I'm wondering how big you've gotten maui and torque to scale, mostly
> |interested in number of nodes?
> |
> |The docs say something like 1,000 but I think it scales well beyond that,
> |no?
>
> That's what I've heard. Right now we're at about 300 nodes.
Are you able to start a parallel job spanning all of these 300 nodes
or is the mom-to-mom communication setup breaking down?
We have problems starting jobs wider than about 100 nodes, because
that amount of moms gets difficulties synchronizing among themselves
at startup.
-- Lennart Karlsson <Lennart.Karlsson at nsc.liu.se>
National Supercomputer Centre in Linkoping, Sweden
http://www.nsc.liu.se
More information about the torqueusers
mailing list