[torqueusers] no error, output fille / log file error message : bad attempt to connect from 127.0.0.1
정지욱
librasonatine at gmail.com
Tue Jul 13 09:32:45 MDT 2010
I am a beginner to Linux and Clustering
I'm making a Linux cluster system.
I tested clustering by 2computers(head.NTVT and node4.NTVT)
I installed torque-2.4.8 and mpich 1.2.7p1.
pbsnodes -a is ok.....
I shared passwd of ssh. So, I can access any computer without ssh passwd.
I executed qsub with simple hello script file.
#!/bin/sh
echo Hello World
And then, When I type qstat -f or qstat -ns
Everything looks like ok..
Job Id: 158.head.ntvt
Job_Name = hello
Job_Owner = root at head.ntvt
resources_used.cput = 00:00:00
resources_used.mem = 0kb
resources_used.vmem = 0kb
resources_used.walltime = 00:00:00
job_state = E
queue = pbsq_day
server = head.ntvt
Checkpoint = u
ctime = Tue Jul 13 16:32:37 2010
Error_Path = head.ntvt:/NPB/hello.e158
exec_host = node4.NTVT/0
Hold_Types = n
Join_Path = n
Keep_Files = n
Mail_Points = a
mtime = Tue Jul 13 16:32:37 2010
Output_Path = head.ntvt:/NPB/hello.o158
Priority = 0
qtime = Tue Jul 13 16:32:37 2010
Rerunable = True
session_id = 4553
substate = 51
Variable_List =
PBS_O_HOME=/root,PBS_O_LANG=ko_KR.UTF-8,PBS_O_LOGNAME=root,
PBS_O_PATH=usr/mpich/bin:usr/mpich/bin:/usr/lib64/qt-3.3/bin:/usr/ker
beros/sbin:/usr/kerberos/bin:/usr/local/sbin:/usr/local/bin:/sbin:/bin
:/usr/sbin:/usr/bin:/usr/X11R6/bin:/root/bin:/usr/mpich/bin:usr/local/
bin:/usr/local/bin,PBS_O_MAIL=/var/spool/mail/root,
PBS_O_SHELL=/bin/bash,PBS_SERVER=head.NTVT,PBS_O_WORKDIR=/NPB,
PBS_O_QUEUE=defaultq,PBS_O_HOST=head.ntvt
euser = root
egroup = root
hashname = 158.head.ntvt
queue_rank = 8
queue_type = E
comment = Job started on Tue Jul 13 at 16:32
etime = Tue Jul 13 16:32:37 2010
exit_status = 0
submit_args = hello
start_time = Tue Jul 13 16:32:37 2010
start_count = 1
fault_tolerant = False
However, there are not any output, error file in designated folders...Other
script programs are the same. So, I checked my server_log files..And I found
these messages.
this is a part of messages.
/server_logs/20100713
07/13/2010 16:32:30;0001;PBS_Server;Svr;PBS_Server;LOG_ERROR::is_request,
bad attempt to connect from 127.0.0.1:1023 (address not trusted - check
entry in server_priv/nodes)
07/13/2010 16:32:32;0001;PBS_Server;Svr;PBS_Server;LOG_ERROR::is_request,
bad attempt to connect from 127.0.0.1:1023 (address not trusted - check
entry in server_priv/nodes)
07/13/2010 16:32:34;0001;PBS_Server;Svr;PBS_Server;LOG_ERROR::is_request,
bad attempt to connect from 127.0.0.1:1023 (address not trusted - check
entry in server_priv/nodes)
07/13/2010 16:32:36;0001;PBS_Server;Svr;PBS_Server;LOG_ERROR::is_request,
bad attempt to connect from 127.0.0.1:1023 (address not trusted - check
entry in server_priv/nodes)
07/13/2010 16:32:37;0100;PBS_Server;Job;157.head.ntvt;enqueuing into
defaultq, state 1 hop 1
07/13/2010 16:32:37;0100;PBS_Server;Job;157.head.ntvt;dequeuing from
defaultq, state QUEUED
07/13/2010 16:32:37;0100;PBS_Server;Job;157.head.ntvt;enqueuing into
pbsq_day, state 1 hop 1
07/13/2010 16:32:37;0008;PBS_Server;Job;157.head.ntvt;Job Queued at request
of root at head.ntvt, owner = root at head.ntvt, job name = hello, queue =
pbsq_day
07/13/2010 16:32:37;0040;PBS_Server;Svr;head.ntvt;Scheduler was sent the
command new
07/13/2010 16:32:37;0008;PBS_Server;Job;157.head.ntvt;Job Modified at
request of Scheduler at head.NTVT
07/13/2010 16:32:37;0008;PBS_Server;Job;157.head.ntvt;Job Run at request of
Scheduler at head.NTVT
07/13/2010 16:32:37;0040;PBS_Server;Svr;head.ntvt;Scheduler was sent the
command recyc
07/13/2010 16:32:37;000d;PBS_Server;Job;157.head.ntvt;Not sending email:
User does not want mail of this type.
07/13/2010 16:32:37;000d;PBS_Server;Job;157.head.ntvt;Not sending email:
User does not want mail of this type.
07/13/2010 16:32:37;0010;PBS_Server;Job;157.head.ntvt;Exit_status=0
resources_used.cput=00:00:00 resources_used.mem=0kb resources_used.vmem=0kb
resources_used.walltime=00:00:00
07/13/2010 16:32:37;0100;PBS_Server;Job;158.head.ntvt;enqueuing into
defaultq, state 1 hop 1
07/13/2010 16:32:37;0100;PBS_Server;Job;158.head.ntvt;dequeuing from
defaultq, state QUEUED
07/13/2010 16:32:37;0100;PBS_Server;Job;158.head.ntvt;enqueuing into
pbsq_day, state 1 hop 1
07/13/2010 16:32:37;0008;PBS_Server;Job;158.head.ntvt;Job Queued at request
of root at head.ntvt, owner = root at head.ntvt, job name = hello, queue =
pbsq_day
07/13/2010 16:32:37;0040;PBS_Server;Svr;head.ntvt;Scheduler was sent the
command new
07/13/2010 16:32:37;0008;PBS_Server;Job;158.head.ntvt;Job Modified at
request of Scheduler at head.NTVT
07/13/2010 16:32:37;0008;PBS_Server;Job;158.head.ntvt;Job Run at request of
Scheduler at head.NTVT
07/13/2010 16:32:37;0040;PBS_Server;Svr;head.ntvt;Scheduler was sent the
command recyc
07/13/2010 16:32:37;000d;PBS_Server;Job;158.head.ntvt;Not sending email:
User does not want mail of this type.
07/13/2010 16:32:37;000d;PBS_Server;Job;158.head.ntvt;Not sending email:
User does not want mail of this type.
07/13/2010 16:32:37;0010;PBS_Server;Job;158.head.ntvt;Exit_status=0
resources_used.cput=00:00:00 resources_used.mem=0kb resources_used.vmem=0kb
resources_used.walltime=00:00:00
07/13/2010 16:32:38;0001;PBS_Server;Svr;PBS_Server;LOG_ERROR::is_request,
bad attempt to connect from 127.0.0.1:1023 (address not trusted - check
entry in server_priv/nodes)
07/13/2010 16:32:40;0001;PBS_Server;Svr;PBS_Server;LOG_ERROR::is_request,
bad attempt to connect from 127.0.0.1:1023 (address not trusted - check
entry in server_priv/nodes)
07/13/2010 16:32:42;0001;PBS_Server;Svr;PBS_Server;LOG_ERROR::is_request,
bad attempt to connect from 127.0.0.1:1023 (address not trusted - check
entry in server_priv/nodes)
07/13/2010 16:32:44;0001;PBS_Server;Svr;PBS_Server;LOG_ERROR::is_request,
bad attempt to connect from 127.0.0.1:1023 (address not trusted - check
entry in server_priv/nodes)
07/13/2010 16:32:45;000d;PBS_Server;Job;157.head.ntvt;Post job file
processing error; job 157.head.ntvt on host node4.NTVT/0
07/13/2010 16:32:45;0100;PBS_Server;Job;157.head.ntvt;dequeuing from
pbsq_day, state COMPLETE
07/13/2010 16:32:45;0040;PBS_Server;Svr;head.ntvt;Scheduler was sent the
command term
07/13/2010 16:32:45;000d;PBS_Server;Job;158.head.ntvt;Post job file
processing error; job 158.head.ntvt on host node4.NTVT/0
07/13/2010 16:32:45;0100;PBS_Server;Job;158.head.ntvt;dequeuing from
pbsq_day, state COMPLETE
07/13/2010 16:32:45;0040;PBS_Server;Svr;head.ntvt;Scheduler was sent the
command term
07/13/2010 16:32:46;0001;PBS_Server;Svr;PBS_Server;LOG_ERROR::is_request,
bad attempt to connect from 127.0.0.1:1023 (address not trusted - check
entry in server_priv/nodes)
07/13/2010 16:32:48;0001;PBS_Server;Svr;PBS_Server;LOG_ERROR::is_request,
bad attempt to connect from 127.0.0.1:1023 (address not trusted - check
entry in server_priv/nodes)
07/13/2010 16:32:50;0001;PBS_Server;Svr;PBS_Server;LOG_ERROR::is_request,
bad attempt to connect from 127.0.0.1:1023 (address not trusted - check
entry in server_priv/nodes)
server_priv/nodes looks like this.
/server_priv/nodes
node4.NTVT np=4
I'm using a root account and NFS.
I think network may have some errors.. How can I find this error? Please
help me..Thank you.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://www.supercluster.org/pipermail/torqueusers/attachments/20100714/db6fbbb8/attachment-0001.html
More information about the torqueusers
mailing list