[torqueusers] Unknown Job Id Behavior

Joshua Bernstein jbernstein at penguincomputing.com
Fri Jun 6 12:05:52 MDT 2008



Chris Samuel wrote:
> ----- "Joshua Bernstein" <jbernstein at penguincomputing.com> wrote:
> 
>> Doesn't help.
> 
> :-)
> 
>> I still think there is a problem with some area of the communication 
>> between pbs_mom and pbs_server.
> 
> Quite possibly.

If so, why haven't the TORQUE guys commented on this issue?

>> If pbs_mom responds to pbs_server with a message saying that it
>> doesn't know anything about the job, shouldn't pbs_server just
>> consider the job  dead, and either re-queue it or just notify
>> the user?
> 
> Yes, and I believe that this is the same problem I reported
> back at the start of May with 2.3.0 here:
> 
> http://www.clusterresources.com/pipermail/torqueusers/2008-May/007275.html
> 
> It appears that it might be fixed in the recent snapshots
> (or at least I've not seen it giving me problems recently).

Maybe, I'll have to give one of those a try. What snapshot are you 
running? Anybody know of a CHANGELOG that mentions this issue?

-Josh


More information about the torqueusers mailing list