[Mauiusers] checkpointing node 'job name' is correct behavior?

Heiga ZEN (Byung Ha CHUN) zen at sp.nitech.ac.jp
Mon Jul 2 20:56:46 MDT 2007


Hi,

Garrick Staples wrote (2007/07/03 4:51):

>> 07/02 12:54:26 INFO:     checkpointing node 'p4-6'
>> 07/02 12:54:26 INFO:     checkpointing node 'p4-7'
>> ...
>> 07/02 12:54:26 INFO:     checkpointing node 'pd4-13'
>> 07/02 12:54:26 INFO:     checkpointing node '5958.jasmine'
>> 07/02 12:54:26 INFO:     checkpointing node '5959.jasmine'
>> ...
>> 07/02 12:54:26 INFO:     checkpointing node '6044.jasmine'
> 
> This looks like an old bug in the pbs client libraries that was fixed
> years ago.  Maui would issue a pbs_statnode() call, the data read had a
> particular timeout, and the data would still be on the wire for the next
> call to pbs_statjob().

OK, I see.

> You didn't say the version, but I assume an old version of TORQUE.
> Update your TORQUE and rebuild Maui after installing the updated TORQUE
> (updating Maui is not required for this particular bug).
 
Hmm, I'm using TORQUE 2.1.7 (not so old, isn't it?).
Anyway, I'll update TORQUE and check this phenomenon.

Thank you very much.

Heiga ZEN (Byung Ha CHUN)

-- 
------------------------------------------------
 Heiga ZEN     (in Japanese pronunciation)
 Byung Ha CHUN (in Korean pronunciation)

 Department of Computer Science and Engineering
 Nagoya Institute of Technology
 Gokiso-cho, Showa-ku, Nagoya 466-8555 Japan

 http://www.sp.nitech.ac.jp/~zen
------------------------------------------------


More information about the mauiusers mailing list