[torqueusers] Torque queue stalls, strange server log messages

Chris Samuel csamuel at vpac.org
Sat Mar 3 23:38:02 MST 2007


On Sun, 4 Mar 2007, Motin wrote:

> I think I have a good overview over the problem now. The scheduler seems to
> die when I add hundreds of jobs at once. I had to restart the scheduler and
> then the server to get things running again.

Sounds like a bug in the pbs_sched, probably worth trying Maui instead unless 
you'd like to do a post mortem of pbs_sched to see where it's dieing ?

>  I guess torque isn't meant to receive hundreds of job requests at a time.

It can handle that, and has been able to for some years with the proviso that 
the system that I helped set up was running Maui.

cheers!
Chris
-- 
 Christopher Samuel - (03)9925 4751 - VPAC Deputy Systems Manager
 Victorian Partnership for Advanced Computing http://www.vpac.org/
 Bldg 91, 110 Victoria Street, Carlton South, VIC 3053, Australia



More information about the torqueusers mailing list