[torqueusers] how to browse the stdout and stderr files of a running batch job

David Beer dbeer at adaptivecomputing.com
Tue Nov 6 16:29:53 MST 2012


Gareth makes an excellent point. You may desire to alter settings that
control how often stdout and stderr are flushed if you choose these options.

David

On Tue, Nov 6, 2012 at 4:18 PM, <Gareth.Williams at csiro.au> wrote:

> Spooling direct to a shared parallel file system can have a significant
> performance impact (lots of small io transactions). Use these options with
> caution!
>
> Gareth
> ________________________________________
> From: Nick Ihli [nick at adaptivecomputing.com]
> Sent: Saturday, 3 November 2012 12:32 PM
> To: Torque Users Mailing List
> Cc: Torque Users Mailing List
> Subject: Re: [torqueusers] how to browse the stdout and stderr files of a
>       running batch job
>
> You can check out $spool_as_final_name and $nospool_dir_list in the mom
> config file.
>
> You can read more about that in Appendix C in the torque docs.
>
> -- Nick Ihli
>
> On Nov 2, 2012, at 9:44 AM, Nick Lindberg <nlindberg at mkei.org<mailto:
> nlindberg at mkei.org>> wrote:
>
> Hello,
>
> It is possible to view these files, but Torque/PBS stores the file on the
> local "mother superior" node while the job is running and only transfers
> them to their final resting place once the job is finished.  Depending on
> what your credentials are, you can figure out what node your job is running
> and check in the torque/pbs spool directory for the files as they are being
> updated.  It is nice to be able to view the files as they run just as you
> say to monitor the process of a simulation.  I find myself doing this quite
> often for our users.
>
> I'm guessing this is what the scripts you mention are doing in an
> automated way.  It would be nice, however, if this functionality was built
> in to PBS/Torque.
>
> Let me know if this is of interest and I can help you figure it out.
>
> Nick Lindberg
> Sr. Consulting Engineer, HPC
> Milwaukee Institute
> 414.727.6413 (W)
> http://www.mkei.org
>
>
>
>
>
>
>
> From: "mauede at alice.it<mailto:mauede at alice.it>" <mauede at alice.it<mailto:
> mauede at alice.it>>
> Reply-To: "mauede at alice.it<mailto:mauede at alice.it>" <mauede at alice.it
> <mailto:mauede at alice.it>>, Torque Users Mailing List <
> torqueusers at supercluster.org<mailto:torqueusers at supercluster.org>>
> Date: Wednesday, October 31, 2012 10:44 PM
> To: "torqueusers at supercluster.org<mailto:torqueusers at supercluster.org>" <
> torqueusers at supercluster.org<mailto:torqueusers at supercluster.org>>
> Subject: [torqueusers] how to browse the stdout and stderr files of a
> running batch job
>
> I submit long Monte Carlo simulations to the PBS scheduler. I redirect
> each simulation stdout and stderr to a uniquely named file. Unluckily, as
> far as I know, PBS does not allow me to peep at such a file before the
> simulation is finished, whether it has completed OK or has aborted.
> Some super-computer centers have developed commands, like qcat, qpeep,
> that allow for monitoring number-cruncher programs and browse their stdout
> and stderr. This feature helps saving a lot of CPU time when the submitted
> jobs are not doing what is expected.
> Is there a free-ware implementation of qcat or qpeep  from some websites ?
> Thank you in advance.
> Regards,
> maura
>
>
> _______________________________________________
> torqueusers mailing list
> torqueusers at supercluster.org<mailto:torqueusers at supercluster.org>
> http://www.supercluster.org/mailman/listinfo/torqueusers
> _______________________________________________
> torqueusers mailing list
> torqueusers at supercluster.org
> http://www.supercluster.org/mailman/listinfo/torqueusers
>



-- 
David Beer | Senior Software Engineer
Adaptive Computing
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://www.supercluster.org/pipermail/torqueusers/attachments/20121106/1997c0e1/attachment-0001.html 


More information about the torqueusers mailing list