[torquedev] Patch: spread job polling more uniformly

Ken Nielson knielson at adaptivecomputing.com
Thu Jan 26 17:10:57 MST 2012



----- Original Message -----
> From: "Eygene Ryabinkin" <rea+maui at grid.kiae.ru>
> To: torquedev at supercluster.org
> Sent: Saturday, January 21, 2012 1:07:31 PM
> Subject: [torquedev] Patch: spread job polling more uniformly
> 
> Good day.
> 
> It was a long time since I had posted to this list, but now I have
> a patch that should help a busy scheduling systems to be more
> responsive.  During the optimization of our Torque/Maui server,
> I had found that the spread of the running job polls is governed
> by the queue_rank value: the remainder from its division by
> JobStatRate is used as the spreading factor.
> 
> That's not a very good thing, because we can do better by just
> spreading the polls really uniformly:
>   t_n = now + n * JobStatRate / N_running.
> 
> The attached patch does this.  It is currently being tested on
> our cluster and shows no regressions for some hours.  I'll test
> it more thorougly, but the code review will also be good.
> --
> Eygene Ryabinkin, Russian Research Centre "Kurchatov Institute"
> 

Eygene,

Thanks for the patch. I will merge it and test it. What version of TORQUE was this made against?

Ken


More information about the torquedev mailing list