[torqueusers] qsub ... queue hang

Garrick Staples garrick at usc.edu
Tue Dec 4 22:37:53 MST 2007


On Mon, Dec 03, 2007 at 07:47:01PM -0600, Zhiliang Hu alleged:
> Somehow all jobs submitted via "qsub" hangs on queue on my linux cluster, and I can't see why.  Here are some details:
> 
> I have a "run" file containing one line to run a "hello" program:
> "/opt/openmpi.gcc/bin/mpirun -np 6 -machinefile machines ./hello"
> 
> It runs fine on command line:
> 
> > sh run 
> Comm_size is 6 with return value 0
> Received Hello from process 1 from process 1
> Received Hello from process 2 from process 2
> Received Hello from process 3 from process 3
> Received Hello from process 4 from process 4
> Received Hello from process 5 from process 5
> 
> However when submitted to "qsub":
> 
> > sh run  | qsub
> 49.cluster2.xxxx.xxxxxxx.xxx
> 
> -- it hangs there forever:

'sh run' is executing and qsub is waiting for it to exit so it can submit the
output as a job.

I think you want 'echo sh run | qsub'.

-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: not available
Url : http://www.supercluster.org/pipermail/torqueusers/attachments/20071204/06a96039/attachment.bin


More information about the torqueusers mailing list