[torqueusers] .OU and .ER files not being sent to user

Jerry Smith jdsmit at sandia.gov
Wed Feb 27 09:50:17 MST 2008


You can use :

momctl -r <path to config>

To re-read config changes, on running moms.  I would suggest doing this 
on moms not currently running jobs as well.

Jerry

James J Coyle wrote:
>   You should at least restart the pbs_mom on the node after changing
> the config file, otherwise, I don't think it will re-read the config
> file.
>
>   This is similar for any daemon, as the daemon will load configuration
> tables into memory when it starts and not re-read config files from
> disk before every action it takes.
>
>   Possibly a kill -HUP on the pbs_mom pid would work force a re-read
> of the dconfig files, but that depends on the programming of pbs_mom.
> Safest it to restart a daemon when config files are changed.  IN the case
> of the pbs_mom, this should be OK as long as nothing is running on the ndoe.
>
>  - Jim Coyle
>
>   
>> Dear Garrick (and any other Torque users)
>>
>> I did as you suggested and my mom_priv/config file is now:
>>
>> $pbsserver dphpc1011.dph.aber.ac.uk
>> $usecp *:/data01 /data01
>> $logevent       255
>>
>> but this still does not solve the problem.
>>
>> The strange thing is that I have managed to get this facility to work
>> in the past, I used the following line
>>
>> $usecp dphpc1011.dph.aber.ac.uk:/users/guest1 /users/guest1
>>
>> but then due to a technical problem (not my fault) I was forced to
>> reboot one of the slaves and after that neither the line you suggest
>> or the one I was using has worked.
>>
>> Can I interrogate the successfully copied files in some way to find
>> out what the system settings were which allowed them to be copied?
>>
>> Is there any way to gain more diagnostic information from the system
>> as to why the files are not being copied?
>>
>> Is there something I need to do after making changes to my
>> mom_priv/config file - eg reboot the slave? I have tried restarting
>> pbserver on the master each time after altering the mom_priv/config
>> file but this does not help.
>>
>> Please don't hesitate to let me know if there is any more detail I can give you.
>>
>> Many thanks for your patience!!
>>
>> Adil
>>
>> On Tue, Feb 26, 2008 at 6:46 PM, Garrick Staples <garrick at usc.edu> wrote:
>>     
>>> On Tue, Feb 26, 2008 at 06:07:19PM +0000, Adil Mughal alleged:
>>>
>>>       
>>>> $usecp dphpc1011.dph.aber.ac.uk:/users/guest1 /users/guest1
>>>>         
>>>  >
>>>  > Unable to copy file /var/spool/torque/spool/168.dphpc10.OU to
>>>  > guest1 at dphpc1011.dph.aber.ac.uk:/data01/guest1/STDIN.o168
>>>
>>>  So you'd want a $usecp line for /data01.
>>>   $usecp *:/data01 /data01
>>>
>>>  --
>>>
>>>
>>> Garrick Staples, GNU/Linux HPCC SysAdmin
>>>  University of Southern California
>>>
>>>  Please avoid sending me Word or PowerPoint attachments.
>>>  See http://www.gnu.org/philosophy/no-word-attachments.html
>>>
>>> _______________________________________________
>>>  torqueusers mailing list
>>>  torqueusers at supercluster.org
>>>  http://www.supercluster.org/mailman/listinfo/torqueusers
>>>
>>>
>>>       
>> _______________________________________________
>> torqueusers mailing list
>> torqueusers at supercluster.org
>> http://www.supercluster.org/mailman/listinfo/torqueusers
>>
>>     
>
>
>
>
> _______________________________________________
> torqueusers mailing list
> torqueusers at supercluster.org
> http://www.supercluster.org/mailman/listinfo/torqueusers
>
>
>   
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://www.supercluster.org/pipermail/torqueusers/attachments/20080227/6b4135f0/attachment.html


More information about the torqueusers mailing list