[torqueusers] how to browse the stdout and stderr files of a running batch job
nick at adaptivecomputing.com
Fri Nov 2 19:32:55 MDT 2012
You can check out $spool_as_final_name and $nospool_dir_list in the mom config file.
You can read more about that in Appendix C in the torque docs.
-- Nick Ihli
On Nov 2, 2012, at 9:44 AM, Nick Lindberg <nlindberg at mkei.org> wrote:
> It is possible to view these files, but Torque/PBS stores the file on the local "mother superior" node while the job is running and only transfers them to their final resting place once the job is finished. Depending on what your credentials are, you can figure out what node your job is running and check in the torque/pbs spool directory for the files as they are being updated. It is nice to be able to view the files as they run just as you say to monitor the process of a simulation. I find myself doing this quite often for our users.
> I'm guessing this is what the scripts you mention are doing in an automated way. It would be nice, however, if this functionality was built in to PBS/Torque.
> Let me know if this is of interest and I can help you figure it out.
> Nick Lindberg
> Sr. Consulting Engineer, HPC
> Milwaukee Institute
> 414.727.6413 (W)
> From: "mauede at alice.it" <mauede at alice.it>
> Reply-To: "mauede at alice.it" <mauede at alice.it>, Torque Users Mailing List <torqueusers at supercluster.org>
> Date: Wednesday, October 31, 2012 10:44 PM
> To: "torqueusers at supercluster.org" <torqueusers at supercluster.org>
> Subject: [torqueusers] how to browse the stdout and stderr files of a running batch job
> I submit long Monte Carlo simulations to the PBS scheduler. I redirect each simulation stdout and stderr to a uniquely named file. Unluckily, as far as I know, PBS does not allow me to peep at such a file before the simulation is finished, whether it has completed OK or has aborted.
> Some super-computer centers have developed commands, like qcat, qpeep, that allow for monitoring number-cruncher programs and browse their stdout and stderr. This feature helps saving a lot of CPU time when the submitted jobs are not doing what is expected.
> Is there a free-ware implementation of qcat or qpeep from some websites ?
> Thank you in advance.
> torqueusers mailing list
> torqueusers at supercluster.org
-------------- next part --------------
An HTML attachment was scrubbed...
More information about the torqueusers