[torqueusers] Question about file spooling, spool directory
Craig Tierney
Craig.Tierney at noaa.gov
Mon Nov 10 13:54:19 MST 2008
I used to use OpenPBS ages ago, and I am looking to install Torque
on my cluster. I see that Torque still has the behavior that the
standard output files (stdout and stderr) are still spooled and
then copied only when the job is completed. I understand that when
PBS was created, that a shared filesystem was not necessary a guarantee,
and the copy was needed.
Now, I don't see why this function would be needed. NFS can scale quite
well, and there are many distributed filesystems (Lustre, Panasas, etc.)
that can scale so that all nodes have a uniform shared filesystem.
So, why does Torque still do the spool and copy method? With Torque,
you can spool the files into the users home directory (with --disable-spool),
although the filenames are still the spooled names, not the final name.
Is there a fundamental technological reason (besides no one has
written the code) for not just putting the stderr and stdout into the "final"
location of that data?
Craig
--
Craig Tierney (craig.tierney at noaa.gov)
More information about the torqueusers
mailing list