[torqueusers] no error, output fille / log file error message : bad attempt to connect from 127.0.0.1

정지욱 librasonatine at gmail.com
Tue Jul 13 09:32:45 MDT 2010


I am a beginner to Linux and Clustering

I'm making a Linux cluster system.

I tested clustering by 2computers(head.NTVT and node4.NTVT)

I installed torque-2.4.8 and mpich 1.2.7p1.

pbsnodes -a is ok.....

I shared passwd of ssh. So, I can access any computer without ssh passwd.

I executed qsub with simple hello script file.

#!/bin/sh
echo Hello World

And then, When I type qstat -f or qstat -ns
Everything looks like ok..

Job Id: 158.head.ntvt
    Job_Name = hello
    Job_Owner = root at head.ntvt
    resources_used.cput = 00:00:00
    resources_used.mem = 0kb
    resources_used.vmem = 0kb
    resources_used.walltime = 00:00:00
    job_state = E
    queue = pbsq_day
    server = head.ntvt
    Checkpoint = u
    ctime = Tue Jul 13 16:32:37 2010
    Error_Path = head.ntvt:/NPB/hello.e158
    exec_host = node4.NTVT/0
    Hold_Types = n
    Join_Path = n
    Keep_Files = n
    Mail_Points = a
    mtime = Tue Jul 13 16:32:37 2010
    Output_Path = head.ntvt:/NPB/hello.o158
    Priority = 0
    qtime = Tue Jul 13 16:32:37 2010
    Rerunable = True
    session_id = 4553
    substate = 51
    Variable_List =
PBS_O_HOME=/root,PBS_O_LANG=ko_KR.UTF-8,PBS_O_LOGNAME=root,
    PBS_O_PATH=usr/mpich/bin:usr/mpich/bin:/usr/lib64/qt-3.3/bin:/usr/ker
    beros/sbin:/usr/kerberos/bin:/usr/local/sbin:/usr/local/bin:/sbin:/bin
    :/usr/sbin:/usr/bin:/usr/X11R6/bin:/root/bin:/usr/mpich/bin:usr/local/
    bin:/usr/local/bin,PBS_O_MAIL=/var/spool/mail/root,
    PBS_O_SHELL=/bin/bash,PBS_SERVER=head.NTVT,PBS_O_WORKDIR=/NPB,
    PBS_O_QUEUE=defaultq,PBS_O_HOST=head.ntvt
    euser = root
    egroup = root
    hashname = 158.head.ntvt
    queue_rank = 8
    queue_type = E
    comment = Job started on Tue Jul 13 at 16:32
    etime = Tue Jul 13 16:32:37 2010
    exit_status = 0
    submit_args = hello
    start_time = Tue Jul 13 16:32:37 2010
    start_count = 1
    fault_tolerant = False


However, there are not any output, error file in designated folders...Other
script programs are the same. So, I checked my server_log files..And I found
these messages.
this is a part of messages.

/server_logs/20100713
07/13/2010 16:32:30;0001;PBS_Server;Svr;PBS_Server;LOG_ERROR::is_request,
bad attempt to connect from 127.0.0.1:1023 (address not trusted - check
entry in server_priv/nodes)
07/13/2010 16:32:32;0001;PBS_Server;Svr;PBS_Server;LOG_ERROR::is_request,
bad attempt to connect from 127.0.0.1:1023 (address not trusted - check
entry in server_priv/nodes)
07/13/2010 16:32:34;0001;PBS_Server;Svr;PBS_Server;LOG_ERROR::is_request,
bad attempt to connect from 127.0.0.1:1023 (address not trusted - check
entry in server_priv/nodes)
07/13/2010 16:32:36;0001;PBS_Server;Svr;PBS_Server;LOG_ERROR::is_request,
bad attempt to connect from 127.0.0.1:1023 (address not trusted - check
entry in server_priv/nodes)
07/13/2010 16:32:37;0100;PBS_Server;Job;157.head.ntvt;enqueuing into
defaultq, state 1 hop 1
07/13/2010 16:32:37;0100;PBS_Server;Job;157.head.ntvt;dequeuing from
defaultq, state QUEUED
07/13/2010 16:32:37;0100;PBS_Server;Job;157.head.ntvt;enqueuing into
pbsq_day, state 1 hop 1
07/13/2010 16:32:37;0008;PBS_Server;Job;157.head.ntvt;Job Queued at request
of root at head.ntvt, owner = root at head.ntvt, job name = hello, queue =
pbsq_day
07/13/2010 16:32:37;0040;PBS_Server;Svr;head.ntvt;Scheduler was sent the
command new
07/13/2010 16:32:37;0008;PBS_Server;Job;157.head.ntvt;Job Modified at
request of Scheduler at head.NTVT
07/13/2010 16:32:37;0008;PBS_Server;Job;157.head.ntvt;Job Run at request of
Scheduler at head.NTVT
07/13/2010 16:32:37;0040;PBS_Server;Svr;head.ntvt;Scheduler was sent the
command recyc
07/13/2010 16:32:37;000d;PBS_Server;Job;157.head.ntvt;Not sending email:
User does not want mail of this type.
07/13/2010 16:32:37;000d;PBS_Server;Job;157.head.ntvt;Not sending email:
User does not want mail of this type.
07/13/2010 16:32:37;0010;PBS_Server;Job;157.head.ntvt;Exit_status=0
resources_used.cput=00:00:00 resources_used.mem=0kb resources_used.vmem=0kb
resources_used.walltime=00:00:00
07/13/2010 16:32:37;0100;PBS_Server;Job;158.head.ntvt;enqueuing into
defaultq, state 1 hop 1
07/13/2010 16:32:37;0100;PBS_Server;Job;158.head.ntvt;dequeuing from
defaultq, state QUEUED
07/13/2010 16:32:37;0100;PBS_Server;Job;158.head.ntvt;enqueuing into
pbsq_day, state 1 hop 1
07/13/2010 16:32:37;0008;PBS_Server;Job;158.head.ntvt;Job Queued at request
of root at head.ntvt, owner = root at head.ntvt, job name = hello, queue =
pbsq_day
07/13/2010 16:32:37;0040;PBS_Server;Svr;head.ntvt;Scheduler was sent the
command new
07/13/2010 16:32:37;0008;PBS_Server;Job;158.head.ntvt;Job Modified at
request of Scheduler at head.NTVT
07/13/2010 16:32:37;0008;PBS_Server;Job;158.head.ntvt;Job Run at request of
Scheduler at head.NTVT
07/13/2010 16:32:37;0040;PBS_Server;Svr;head.ntvt;Scheduler was sent the
command recyc
07/13/2010 16:32:37;000d;PBS_Server;Job;158.head.ntvt;Not sending email:
User does not want mail of this type.
07/13/2010 16:32:37;000d;PBS_Server;Job;158.head.ntvt;Not sending email:
User does not want mail of this type.
07/13/2010 16:32:37;0010;PBS_Server;Job;158.head.ntvt;Exit_status=0
resources_used.cput=00:00:00 resources_used.mem=0kb resources_used.vmem=0kb
resources_used.walltime=00:00:00
07/13/2010 16:32:38;0001;PBS_Server;Svr;PBS_Server;LOG_ERROR::is_request,
bad attempt to connect from 127.0.0.1:1023 (address not trusted - check
entry in server_priv/nodes)
07/13/2010 16:32:40;0001;PBS_Server;Svr;PBS_Server;LOG_ERROR::is_request,
bad attempt to connect from 127.0.0.1:1023 (address not trusted - check
entry in server_priv/nodes)
07/13/2010 16:32:42;0001;PBS_Server;Svr;PBS_Server;LOG_ERROR::is_request,
bad attempt to connect from 127.0.0.1:1023 (address not trusted - check
entry in server_priv/nodes)
07/13/2010 16:32:44;0001;PBS_Server;Svr;PBS_Server;LOG_ERROR::is_request,
bad attempt to connect from 127.0.0.1:1023 (address not trusted - check
entry in server_priv/nodes)
07/13/2010 16:32:45;000d;PBS_Server;Job;157.head.ntvt;Post job file
processing error; job 157.head.ntvt on host node4.NTVT/0
07/13/2010 16:32:45;0100;PBS_Server;Job;157.head.ntvt;dequeuing from
pbsq_day, state COMPLETE
07/13/2010 16:32:45;0040;PBS_Server;Svr;head.ntvt;Scheduler was sent the
command term
07/13/2010 16:32:45;000d;PBS_Server;Job;158.head.ntvt;Post job file
processing error; job 158.head.ntvt on host node4.NTVT/0
07/13/2010 16:32:45;0100;PBS_Server;Job;158.head.ntvt;dequeuing from
pbsq_day, state COMPLETE
07/13/2010 16:32:45;0040;PBS_Server;Svr;head.ntvt;Scheduler was sent the
command term
07/13/2010 16:32:46;0001;PBS_Server;Svr;PBS_Server;LOG_ERROR::is_request,
bad attempt to connect from 127.0.0.1:1023 (address not trusted - check
entry in server_priv/nodes)
07/13/2010 16:32:48;0001;PBS_Server;Svr;PBS_Server;LOG_ERROR::is_request,
bad attempt to connect from 127.0.0.1:1023 (address not trusted - check
entry in server_priv/nodes)
07/13/2010 16:32:50;0001;PBS_Server;Svr;PBS_Server;LOG_ERROR::is_request,
bad attempt to connect from 127.0.0.1:1023 (address not trusted - check
entry in server_priv/nodes)

server_priv/nodes looks like this.

/server_priv/nodes
node4.NTVT np=4

I'm using a root account and NFS.

I think network may have some errors.. How can I find this error? Please
help me..Thank you.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://www.supercluster.org/pipermail/torqueusers/attachments/20100714/db6fbbb8/attachment-0001.html 


More information about the torqueusers mailing list