[torqueusers] Problems upgrading from 2.4 to 2.5
Glen Beane
glen.beane at gmail.com
Mon Nov 29 19:31:23 MST 2010
On Mon, Nov 29, 2010 at 10:07 AM, J.A. Magallón <jamagallon at ono.com> wrote:
> Hi all...
>
> First of all, hi to everyone, I'm new to the list.
> I usually have solved my problems with torque with some googling, but this
> is driving me nuts.
>
> I have benn using torque 2.4 for sometime, and everything works fine, But
> now my distro has updraded torque from 2.4.8 to 2.5.3, and I face a curious
> problem.
>
> I have reduced the problem to a simple test, with just one only node and
> a simple and stupid queue:
>
> Queue Memory CPU Time Walltime Node Run Que Lm State
> ---------------- ------ -------- -------- ---- --- --- -- -----
> std -- -- -- -- 0 0 10 E R
>
> No limits, no nothing. Box is a quad core cpu.
>
> With a simple job:
>
> werewolf:~/dev/mpi/tst> cat k
> #!/bin/bash
> #PBS -N x
> #PBS -S /bin/bash
> #PBS -j oe
>
> echo "server:" $PBS_SERVER
> echo "queue: " $PBS_QUEUE
> echo "client:" $PBS_O_HOST
> echo "cwd: " $PBS_O_WORKDIR
>
> echo "nodefile<"$PBS_NODEFILE">:"
> cat $PBS_NODEFILE
>
> sleep 30
>
> with torque 2.4, I could do this:
>
> werewolf:~/dev/mpi/tst> qsub -l nodes=1:ppn=2 k
> 0.werewolf.home
>
> (what I really do is running MPI with mpirun -pernode...)
>
> But with torque 2.5, this does not work anymore:
>
> erewolf:~/dev/mpi/tst> qsub -l nodes=1:ppn=2 k
> qsub: Job exceeds queue resource limits MSG=cannot locate feasible nodes
>
> Uh ? What has changed ? It looks like 2.5 ignores that box has 4 cores...
>
> Any idea ? Some behavior has changed, is it a bug, or should it work
> and perhaps its a packaging/compiler issue ?
ster.org
> http://www.supercluster.org/mailman/listinfo/torqueusers
>
>
what does the pbs_server nodes file look like? What do you see when
you run "pbs_nodes -a"?
More information about the torqueusers
mailing list