[torqueusers] torque/maui problem - defered jobs "Invalid request (15004)"

Garrick Staples garrick at usc.edu
Thu Dec 22 13:52:59 MST 2005


On Thu, Dec 22, 2005 at 01:13:46PM -0700, Austin Godber alleged:
> And I get errors in torque/server_logs/20051222 like this:
> 
> Invalid request (15004) in send_job, child failed in previous commit
> request for job

pbs_server is failing to create its TCP connection to MOM to send the
job.  The likely failure is a failure to bind to a priviledged port.

Which linux distro are you using?  On RHEL3 x86_64 I've observed very
slow binds when many are done quickly.  It was easy to reproduce with
something like this:
  seq 1 1000 | xargs -n 1 -i /usr/sbin/pbs_iff -t $pbsservername 15001

Try it on a 32bit and 64bit host.  It should finish in about 5 seconds.
Does it fail on 64bit?

-- 
Garrick Staples, Linux/HPCC Administrator
University of Southern California
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: not available
Url : http://www.supercluster.org/pipermail/torqueusers/attachments/20051222/ef9025bf/attachment.bin


More information about the torqueusers mailing list