[torqueusers] nodes not receiving jobs

Tom Rudwick tomr at intrinsity.com
Sat Sep 13 19:46:20 MDT 2008


You probably need to up your log level. The errors you show only
mean that someone tried to do a qstat on an invalid/expired job id.

Tom


Adrian Sevcenco wrote:
> Hi! i have a situation in which some nodes with seemingly identical
> configuration to all other nodes, don't accept jobs .. the errors that i
> see in mom_logs are :
> 09/13/2008 12:08:34;0100;   pbs_mom;Req;;Type StatusJob request received
> from PBS_Server at alien.local, sock=10
> 09/13/2008 12:08:34;0080;   pbs_mom;Req;req_reject;Reject reply
> code=15001(Unknown Job Id), aux=0, type=StatusJob, from
> PBS_Server at alien.local
> 
> Any idea how can i debug this further?
> Thank you,
> Best regards,
> Adrian
> 
> 
> 
> ------------------------------------------------------------------------
> 
> _______________________________________________
> torqueusers mailing list
> torqueusers at supercluster.org
> http://www.supercluster.org/mailman/listinfo/torqueusers



More information about the torqueusers mailing list