[torqueusers] all jobs in Q state

Krause, Roland Roland.Krause at amtc-dresden.com
Thu Feb 22 10:49:30 MST 2007


Thanks a lot to all of you!
torque is running jobs again and I'm happy :-) 

-> -----Original Message-----
-> From: James J Coyle [mailto:jjc at iastate.edu] 
-> Sent: Thursday, February 22, 2007 6:41 PM
-> To: Krause, Roland
-> Cc: torqueusers at supercluster.org
-> Subject: Re: [torqueusers] all jobs in Q state 
-> 
-> Roland,
-> 
-> 
-> 
->    You want a fix right now I'm sure.
-> I suggest the following:
-> 
-> (Caveat: I am a Torque user not a developer. Here is what I have done
->   in a similar situation.  I run 4 clusters containing about 200
->   multiprocessor nodes.)
-> 
-> 
-> Edit the file
-> 
->  /var/spool/torque/sched_priv/sched_config
-> 
->   Change the line 
-> help_starving_jobs     true    ALL
-> 
-> to
-> 
-> help_starving_jobs      false   ALL
-> 
-> 
->   Exit the editor, and issue
-> 
-> killall -9 pbs_sched;  /usr/local/sbin/pbs_sched
-> 
-> 
->   The currently running jobs should stay running, and 
-> new jobs should start scheduling again.
-> 
->   Worst case would be losing the currently running jobs,
-> but I have started and stopped both the scheduler and 
-> server in this fashion with Torque and have not lost any jobs.
-> 
-> 
-> 
->  - Jim Coyle 
-> 
-> 


More information about the torqueusers mailing list