[torqueusers] nodes not receiving jobs
Adrian.Sevcenco at cern.ch
Sun Sep 14 04:13:12 MDT 2008
Tom Rudwick wrote:
> You probably need to up your log level. The errors you show only
> mean that someone tried to do a qstat on an invalid/expired job id.
How can i increase the debug level? Is it an option at the start of
torque server? i use torque-2.1.8-1 and i don't see and option for
increasing log verbosity in man page.
> Adrian Sevcenco wrote:
>> Hi! i have a situation in which some nodes with seemingly identical
>> configuration to all other nodes, don't accept jobs .. the errors that i
>> see in mom_logs are :
>> 09/13/2008 12:08:34;0100; pbs_mom;Req;;Type StatusJob request received
>> from PBS_Server at alien.local, sock=10
>> 09/13/2008 12:08:34;0080; pbs_mom;Req;req_reject;Reject reply
>> code=15001(Unknown Job Id), aux=0, type=StatusJob, from
>> PBS_Server at alien.local
>> Any idea how can i debug this further?
>> Thank you,
>> Best regards,
-------------- next part --------------
A non-text attachment was scrubbed...
Size: 3092 bytes
Desc: S/MIME Cryptographic Signature
Url : http://www.supercluster.org/pipermail/torqueusers/attachments/20080914/4fe6539c/smime.bin
More information about the torqueusers