[torqueusers] Weird problem with 2.0.0p4

Garrick Staples garrick at usc.edu
Thu Dec 22 10:52:58 MST 2005


On Thu, Dec 22, 2005 at 11:10:09AM +0100, ?ke Sandgren alleged:
> Hi!
> 
> I just found a weird problem with our 2.0.0p4 install.
> jobs with nodes=2 or larger doesn't work...
> 
> The only thing that starts on the MS are the following processes
> UID        PID  PPID  C STIME TTY          TIME CMD
> ake       4189  4074 99 11:08 ?        00:00:01 -bash
> ake       4211  4189  0 11:08 ?        00:00:00 pbs_demux
> 
> i.e. the submit script itself never starts.
> There is nothing in the logs that i can find.
> 2.0.0p2 works ok (although on another machine)

So what's that bash process doing?  Bash doesn't sit around doing
nothing.  It exits when it runs out of input.  Is it stuck reading on
stdin?  Where does lsof show it's stdio fds?

Configure options?  Using --enable-shell-pipe or
--enable-shell-use-argv?

-- 
Garrick Staples, Linux/HPCC Administrator
University of Southern California
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: not available
Url : http://www.supercluster.org/pipermail/torqueusers/attachments/20051222/d42f6a1c/attachment.bin


More information about the torqueusers mailing list