[torqueusers] Weird problem with 2.0.0p4
Garrick Staples
garrick at usc.edu
Thu Dec 22 10:52:58 MST 2005
On Thu, Dec 22, 2005 at 11:10:09AM +0100, ?ke Sandgren alleged:
> Hi!
>
> I just found a weird problem with our 2.0.0p4 install.
> jobs with nodes=2 or larger doesn't work...
>
> The only thing that starts on the MS are the following processes
> UID PID PPID C STIME TTY TIME CMD
> ake 4189 4074 99 11:08 ? 00:00:01 -bash
> ake 4211 4189 0 11:08 ? 00:00:00 pbs_demux
>
> i.e. the submit script itself never starts.
> There is nothing in the logs that i can find.
> 2.0.0p2 works ok (although on another machine)
So what's that bash process doing? Bash doesn't sit around doing
nothing. It exits when it runs out of input. Is it stuck reading on
stdin? Where does lsof show it's stdio fds?
Configure options? Using --enable-shell-pipe or
--enable-shell-use-argv?
--
Garrick Staples, Linux/HPCC Administrator
University of Southern California
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: not available
Url : http://www.supercluster.org/pipermail/torqueusers/attachments/20051222/d42f6a1c/attachment.bin
More information about the torqueusers
mailing list