[torqueusers] Pbs_server error

scoggins jscoggins at lbl.gov
Mon Apr 2 10:12:25 MDT 2007


check and see if there is a lock file in the $PBS_HOME/server_priv  
directory.


On Apr 1, 2007, at 10:15 PM, SCIPIONI Roberto wrote:

> Hi there,
>
>
>
> I installed Torque  and Maui.
>
> I managed to make Torque to read the libraries but  now when I try  
> to start the server with
>
>
> pbs_server -t create
>
>
> it says
>
> pbs_server -t create
> PBS_Server: Resource temporarily unavailable (11) in PBS_Server,  
> pbs_server: another server running
>
> pbs_server: another server running
>
>
> ----------------------------------------------
>
> I am confused because there is only one server.
> Probably are the specifications about Headnode and nodes right ?
>
>
> Thanks for the help
>
>
> Roberto
>
>
>
>
>
> Dear All,
>
> Im having problems in finishing the configuration of the Torque  
> Batch System.
> Im using the folowing software packages:
>
> torque-scheduler-2.1.6
> torque-client-2.1.6
> torque-scheduler-2.1.6
>
> The queues have been created without any problems and the server  
> can reach by network all the clients. I have checked this last  
> point submiting a simple shell script echo 'date' for 10 times from  
> the server and I can see in the client 10 shell session opened for  
> runing the job.
>
> Job submission script:
>
> # queue selected for that job
> #PBS -q long
> cat $PBS_NODEFILE
> #PBS -o /home5/userxxx/pbs.output
> #PBS -l nodes=1
> #PBS -I
> #PBS -r n
> #PBS -l walltime=12:00:00
> #PBS -M userxxx
> #PBS -N teste
> #########################################
> #       JOB DEFINITION                                       #
> #########################################
> #!/bin/bash
> #!change the working directory (default is home directory)
> cd /home5/userxxx/
> echo Running on host `hostname` > /home5/userxxx/pbs.output
> echo Time is `date` > /home5/userxxx/pbs.output
> echo Directory is `pwd` > /home5/userxxx/pbs.output
>
>
>
> The problem is that I cannot see any output writen to any file.
>
> Here is the relevant line from the server log:
>
>
> 03/29/2007 16:27:28;000d;PBS_Server;Job; 
> 129.pc061.dq.ua.pt.dq.ua.pt;Post job file processing error; job  
> 129.molecular-modeling.dq.ua.pt on host planck.dq.ua.pt
>
>
> In our cluster all the home directories are globally shared by NFS  
> to all nodes and I think that scp will no be used in that case but  
> simple cp command.
> In my opinion the problems may be file transactions between the  
> submission server and the execution (mom) clients.
>
> I provide the client configuration file from pbs_mom (config):
>
> # MOM server configuration file
> # if more than one value, separate it by comma.
> #
> # especifica o servidor de PBS que pode submeter jobs
> $pbsserver pc061.dq.ua.pt
> # especifica os clientes que o pbs_mom pode contactar atraves de  
> portas privilegiadas
> $pbsclient molecular-modeling.dq.ua.pt
> $pbsclient planck.dq.ua.pt
> $loglevel 7
>
>
> I have also checked the undelivered directory in the client  
> (planck.dq.ua.pt) and it is empty.
>
> Can anyone provide me a clue to suceesfully resolve this problem?
> Also if I cannot resolve this issue im planing to migrate the Batch  
> System to Sun Grid Engine. What is your opinion about SGE?
>
>
> Thanks in advance,
>
>
> Best Regard,
>
>
> Nelson Fonseca
> Beowulf Cluster System Administrator
> University of Aveiro
> Portugal
>
>
>
>
> _______________________________________________
> torqueusers mailing list
> torqueusers at supercluster.org
> http://www.supercluster.org/mailman/listinfo/torqueusers
> _______________________________________________
> torqueusers mailing list
> torqueusers at supercluster.org
> http://www.supercluster.org/mailman/listinfo/torqueusers

-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://www.supercluster.org/pipermail/torqueusers/attachments/20070402/cdc000e6/attachment-0001.html


More information about the torqueusers mailing list