[torqueusers] .OU and .ER files not being sent to user

Adil Mughal adil.m.mughal at gmail.com
Wed Feb 27 09:48:14 MST 2008


Dear James, Garrick and other Torque Experts

After making the changes to the mom_priv/config files I then restated
both pbs_server (on master) and pbs_mom (on slaves)... and still it
does not work...

Any further suggestions would be greatly appreciated

adil

On Wed, Feb 27, 2008 at 4:02 PM, James J Coyle <jjc at iastate.edu> wrote:
>
>   You should at least restart the pbs_mom on the node after changing
>  the config file, otherwise, I don't think it will re-read the config
>  file.
>
>   This is similar for any daemon, as the daemon will load configuration
>  tables into memory when it starts and not re-read config files from
>  disk before every action it takes.
>
>   Possibly a kill -HUP on the pbs_mom pid would work force a re-read
>  of the dconfig files, but that depends on the programming of pbs_mom.
>  Safest it to restart a daemon when config files are changed.  IN the case
>  of the pbs_mom, this should be OK as long as nothing is running on the ndoe.
>
>   - Jim Coyle
>
>
>
>  > Dear Garrick (and any other Torque users)
>  >
>  > I did as you suggested and my mom_priv/config file is now:
>  >
>  > $pbsserver dphpc1011.dph.aber.ac.uk
>  > $usecp *:/data01 /data01
>  > $logevent       255
>  >
>  > but this still does not solve the problem.
>  >
>  > The strange thing is that I have managed to get this facility to work
>  > in the past, I used the following line
>  >
>  > $usecp dphpc1011.dph.aber.ac.uk:/users/guest1 /users/guest1
>  >
>  > but then due to a technical problem (not my fault) I was forced to
>  > reboot one of the slaves and after that neither the line you suggest
>  > or the one I was using has worked.
>  >
>  > Can I interrogate the successfully copied files in some way to find
>  > out what the system settings were which allowed them to be copied?
>  >
>  > Is there any way to gain more diagnostic information from the system
>  > as to why the files are not being copied?
>  >
>  > Is there something I need to do after making changes to my
>  > mom_priv/config file - eg reboot the slave? I have tried restarting
>  > pbserver on the master each time after altering the mom_priv/config
>  > file but this does not help.
>  >
>  > Please don't hesitate to let me know if there is any more detail I can give you.
>  >
>  > Many thanks for your patience!!
>  >
>  > Adil
>  >
>  > On Tue, Feb 26, 2008 at 6:46 PM, Garrick Staples <garrick at usc.edu> wrote:
>  > > On Tue, Feb 26, 2008 at 06:07:19PM +0000, Adil Mughal alleged:
>  > >
>  > > > $usecp dphpc1011.dph.aber.ac.uk:/users/guest1 /users/guest1
>  > >  >
>  > >  > Unable to copy file /var/spool/torque/spool/168.dphpc10.OU to
>  > >  > guest1 at dphpc1011.dph.aber.ac.uk:/data01/guest1/STDIN.o168
>  > >
>  > >  So you'd want a $usecp line for /data01.
>  > >   $usecp *:/data01 /data01
>  > >
>  > >  --
>  > >
>  > >
>  > > Garrick Staples, GNU/Linux HPCC SysAdmin
>  > >  University of Southern California
>  > >
>  > >  Please avoid sending me Word or PowerPoint attachments.
>  > >  See http://www.gnu.org/philosophy/no-word-attachments.html
>  > >
>  > > _______________________________________________
>  > >  torqueusers mailing list
>  > >  torqueusers at supercluster.org
>  > >  http://www.supercluster.org/mailman/listinfo/torqueusers
>  > >
>  > >
>  > _______________________________________________
>  > torqueusers mailing list
>  > torqueusers at supercluster.org
>  > http://www.supercluster.org/mailman/listinfo/torqueusers
>  >
>
>
>
>
>


More information about the torqueusers mailing list