[torqueusers] SC'10 TORQUE Birds of a Feather

Joshua Bernstein jbernstein at penguincomputing.com
Mon Jun 28 11:51:16 MDT 2010


Well,

I can talk about Penguin's implementation of HA and pbs_server/Maui/Moab 
failover. I do know the pbs_server HA implementation was fixed a while 
back, but our method is in production with users all over the word, and 
seems to be very reliable.

-Josh

Lloyd Brown wrote:
> On 6/25/10 4:46 PM, Ken Nielson wrote:
>> Adaptive Computing is looking to organize a TORQUE Birds of a Feather for Supercomputing 2010.
>>
>> What needs do you want to see addressed in the meeting if you were to attend.
>>
>> Thanks
>>
>> Ken Nielson
>> Adaptive Computing
>> _______________________________________________
>> torqueusers mailing list
>> torqueusers at supercluster.org
>> http://www.supercluster.org/mailman/listinfo/torqueusers
>>   
> 
> I would love to hear about:
> 
> - Examples of job suspend/restart, especially using BLCR
> - Job Arrays
> - High-availability features of pbs_server
> 
> Of course, I'm not entirely certain that I'll be making it to SC10, but
> that's beside the point.
> 
> Lloyd
> 
> 


More information about the torqueusers mailing list