[torqueusers] node names are defined but pbs_serverdoesn'trecognize

Mahmood Naderan nt_mahmood at yahoo.com
Thu Apr 25 01:26:46 MDT 2013


Please see the attachment. Part of the output which I think is related to nodes file is

open("/var/spool/pbs/server_priv/nodes", O_RDONLY) = 8
fstat(8, {st_mode=S_IFREG|0644, st_size=361, ...}) = 0
mmap(NULL, 4096, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_ANONYMOUS, -1, 0) = 0x7f0051426000
read(8, "## This is the TORQUE server \"no"..., 4096) = 361
open("/etc/hosts", O_RDONLY|O_CLOEXEC)  = 9
fstat(9, {st_mode=S_IFREG|0644, st_size=125, ...}) = 0
mmap(NULL, 4096, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_ANONYMOUS, -1, 0) = 0x7f0051425000
read(9, "127.0.0.1\tlocalhost.localdomain "..., 4096) = 125
read(9, "", 4096)                       = 0
close(9)                                = 0
munmap(0x7f0051425000, 4096)            = 0
ioctl(2, SNDCTL_TMR_TIMEBASE or TCGETS, {B38400 opost isig icanon echo ...}) = 0
write(2, "PBS_Server: LOG_ERROR::process_h"..., 108PBS_Server: LOG_ERROR::process_host_name_part, no valid IP addresses found for 'tiger' - check name service
) = 108
write(5, "04/25/2013 11:52:35;0001;PBS_Ser"..., 147) = 147
socket(PF_FILE, SOCK_DGRAM|SOCK_CLOEXEC, 0) = 9
connect(9, {sa_family=AF_FILE, path="/dev/log"}, 110) = 0
sendto(9, "<27>Apr 25 11:52:35 PBS_Server: "..., 127, MSG_NOSIGNAL, NULL, 0) = 127
write(5, "04/25/2013 11:52:35;0040;PBS_Ser"..., 97) = 97
read(8, "", 4096)                       = 0
close(8)                                = 0
munmap(0x7f0051426000, 4096)            = 0
open("/var/spool/pbs/server_priv/node_status", O_RDONLY) = -1 ENOENT (No such file or directory)
open("/var/spool/pbs/server_priv/node_note", O_RDONLY) = -1 ENOENT (No such file or directory)
open("/var/spool/pbs/server_name", O_RDONLY) = 8
fstat(8, {st_mode=S_IFREG|0644, st_size=6, ...}) = 0
mmap(NULL, 4096, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_ANONYMOUS, -1, 0) = 0x7f0051426000
read(8, "tiger\n", 4096)                = 6
read(8, "", 4096)                       = 0
close(8)                                = 0
munmap(0x7f0051426000, 4096)            = 0
chdir("/var/spool/pbs/server_priv/queues/") = 0


 
Regards,
Mahmood



________________________________
 From: "Pestiaux, Florent" <Florent.Pestiaux at bioclinica.com>
To: Mahmood Naderan <nt_mahmood at yahoo.com> 
Cc: Torque Users Mailing List <torqueusers at supercluster.org> 
Sent: Thursday, April 25, 2013 11:48 AM
Subject: Re: [torqueusers] node names are defined but pbs_serverdoesn'trecognize
 


Please type strace pbs_server
And send the output

--- 
Florent

Le 25 avr. 2013 à 08:56, "Mahmood Naderan" <nt_mahmood at yahoo.com> a écrit :



>>First analyse is easy: Wrong path. You are using /var/spool/pbs instead of /var/spool/torque
>That is the installation path and is correct. 
>
>
>[mahmood at tiger ~]$ ls /var/spool/pbs/
>aux         job_logs  mom_priv         sched_logs  server_logs  server_name.new  spool
>checkpoint  mom_logs  pbs_environment  sched_priv  server_name  server_priv      undelivered
>
>
>
>As I said, we didn't have this problem before reboot.
>
> 
>Regards,
>Mahmood
>
>
>
>
>________________________________
> From: "Pestiaux, Florent" <Florent.Pestiaux at bioclinica.com>
>To: Mahmood Naderan <nt_mahmood at yahoo.com>; Torque Users Mailing List <torqueusers at supercluster.org> 
>Cc: torque cluster <torqueusers at supercluster.org> 
>Sent: Thursday, April 25, 2013 11:22 AM
>Subject: Re: [torqueusers] node names are defined but pbs_server doesn'trecognize
>
>
>
>First analyse is easy: Wrong path. You are using /var/spool/pbs instead of /var/spool/torque
>
>
>
>--- 
>Florent
>
>Le 25 avr. 2013 à 07:55, "Mahmood Naderan" <nt_mahmood at yahoo.com> a écrit :
>
>
>We have stuck at this point. So any idea is appreciated for solving this problem.
>>
>> 
>>Regards,
>>Mahmood
>>
>>
>>
>>
>>________________________________
>> From: Mahmood Naderan <nt_mahmood at yahoo.com>
>>To: torque cluster <torqueusers at supercluster.org> 
>>Sent: Wednesday, April 24, 2013 8:34 PM
>>Subject: [torqueusers] node names are defined but pbs_server doesn't recognize
>>
>>
>>
>>Hi
>>After a reboot (power cutoff), it seems that the pbs_server has problem to run correctly.
>>
>>[root at tiger mahmood]# pbsnodes -l all
>>pbsnodes: Server has no node list MSG=node list is empty - check 'server_priv/nodes' file
>>
>>[root at tiger mahmood]# pbs_server
>>PBS_Server: LOG_ERROR::process_host_name_part, no valid IP addresses found for 'tiger' - check name service
>>pbs_server: network: Address already in use
>>PBS_Server: LOG_ERROR::PBS_Server, init_network failed dis
>>
>>
>>
>>However everything seems to be correct in the config files
>>[root at tiger mahmood]# cat /var/spool/pbs/server_priv/nodes
>>## This is the TORQUE server "nodes" file.
>>##
>>## To add a node, enter its hostname, optional processor count (np=),
>>## and optional feature names.
>>##
>>## Example:
>>##    host01 np=8 featureA featureB
>>##    host02 np=8 featureA featureB
>>##
>>## for more information, please visit:
>>##
>>## http://www.clusterresources.com/torquedocs/nodeconfig.shtml
>>tiger np=31
>>
>>[root at tiger mahmood]# cat /etc/hosts
>>127.0.0.1               localhost.localdomain localhost
>>#::1            localhost6.localdomain6 localhost6
>>192.168.1.5    tiger
>>194.225.69.104 tiger
>>
>> 
>>Any feedback is welcomed.
>>
>>
>>Regards,
>>Mahmood
>>
>>_______________________________________________
>>torqueusers mailing list
>>torqueusers at supercluster.org
>>http://www.supercluster.org/mailman/listinfo/torqueusers
>>
>>
>>
>_______________________________________________
>>torqueusers mailing list
>>torqueusers at supercluster.org
>>http://www.supercluster.org/mailman/listinfo/torqueusers
>>
> 
>-- 
>Confidentiality Notice: This e-mail transmission may contain confidential or legally privileged information that is intended only for the individual or entity named in the e-mail address. If you are not the intended recipient, you are hereby notified that any
 disclosure, copying, distribution, or reliance upon the contents of this e-mail is strictly prohibited. If you have received this e-mail transmission in error, please reply to the sender and then delete the message from your computer. Thank you.
>--
>
>
>
 
-- 
Confidentiality Notice: This e-mail transmission may contain confidential or legally privileged information that is intended only for the individual or entity named in the e-mail address. If you are not the intended recipient, you are hereby notified that any disclosure, copying, distribution, or reliance upon the contents of this e-mail is strictly prohibited. If you have received this e-mail transmission in error, please reply to the sender and then delete the message from your computer. Thank you.
--
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://www.supercluster.org/pipermail/torqueusers/attachments/20130425/c4624b4c/attachment-0001.html 
-------------- next part --------------
A non-text attachment was scrubbed...
Name: strace.pbs_server.rar
Type: application/x-rar
Size: 6990 bytes
Desc: not available
Url : http://www.supercluster.org/pipermail/torqueusers/attachments/20130425/c4624b4c/attachment-0001.bin 


More information about the torqueusers mailing list