[torqueusers] Re: Unexpected qsub error
David McGiven
david.mcgiven at fusemail.com
Thu Nov 16 05:28:04 MST 2006
I forgot to say that now (without any apparent reason) it allows me to
send jobs with 2 cpu's but not with 4. What's going on ?
David
On Thu, 16 Nov 2006 12:52:07 +0100
David McGiven <david.mcgiven at fusemail.com> wrote:
>
> Dear Torque users,
>
> I had to powerdown my two servers for electrical maiteinance. When I
> turned them on again, I found the following error when trying to
> submit
> jobs.
>
> "qsub: Job exceeds queue resource limits"
>
> But i'm asking for two processors where the machine has more than 10
> processors free.
>
> This never happened before, and I'm sure I didn't change anything from
> the config files. Could someone please give me advice ?
>
> The only thing I did was to qdel with "-p" option some jobs before
> shutting the server down. Can this have affected ?
>
> Thanks in advance.
>
> David
>
> The script I'm trying to submit :
>
> ---------CUT---------
> #!/bin/tcsh
> #PBS -S /bin/tcsh
> #PBS -m n
> #PBS -j oe
> #PBS -k oe
> #PBS -l nodes=2
> #PBS -M david
> #PBS -q incal
> #PBS -r n
> # name
> #PBS -N gli_g03-2
> if ($?LD_LIBRARY_PATH) setenv LD_LIBRARY_PATH /opt/soft/gaussian03
> setenv g03root "/opt/soft/gaussian03"
> setenv GAUSS_SCRDIR "/scratch"
> source $g03root/g03/bsd/g03.login
> setenv GAUSS_LFLAGS "-nodefile $PBS_NODEFILE"
>
> g03 glico01.com > glico01.log
> ---------CUT---------
>
> The machine where pbs_server runs is :
>
> - Dell Poweredge, Intel Xeon 3.4 Ghz, Debian, 2.6 i686 kernel
> - Troque version : torque-2.0.0p8
> - Maui version : maui-3.2.6p14
>
> The machine where pbs_mom runs is :
>
> - SGI Altix 330, Intel Itanium2 1.5 Ghz SLES9 (10 CPU's)
> - Torque version : 2.0.0p7
>
> The logfile from the pbs_server :
>
> 11/13/2006 15:39:35;0100;PBS_Server;Req;;Type AuthenticateUser request
> received from david at machine, sock=12
>
> 11/13/2006 15:39:35;0100;PBS_Server;Req;;Type QueueJob request
> received
> from david at machine, sock=11
>
> 11/13/2006 15:39:35;0080;PBS_Server;Req;req_reject;Reject reply
> code=15036(Job exceeds queue resource limits), aux=0, type=QueueJob,
> from david at machine
>
> The logfile from the pbs_mom :
>
> - Nothing new is shown on the log when qsub issued.
>
More information about the torqueusers
mailing list