[torqueusers] PBS_NODEFILE incomplete

Christopher Samuel samuel at unimelb.edu.au
Tue Dec 21 15:53:18 MST 2010


-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Hi Scott,

On 21/12/10 16:44, Scott Hazelhurst wrote:


> 2. checkjob -v --------------------------------------------------
> 
> 
> checking job 5766 (RM job '5766.cream-ce.core.wits.ac.za')
> 
[..]
> Allocated Nodes:
> [n05.core.wits.ac.za:8][n04.core.wits.ac.za:8]

So Maui has got everything right by the look of it.

> 3. Result of qstat -f

[...]
>     exec_host = 
> n04.core.wits.ac.za/7+n04.core.wits.ac.za/6+n04.core.wits.ac.z
> a/5+n04.core.wits.ac.za/4+n04.core.wits.ac.za/3+n04.core.wits.ac.za/2+
>         n04.core.wits.ac.za/1+n04.core.wits.ac.za/0

Awooga - exec_host hasn't been set correctly to match
what Maui thinks it should be!  Do you have any errors
in your Maui logs or pbs_server logs corresponding to
this job ?

> 4. Output of PBS_NODES
> 
[...]

That's not actually wrong in the sense that it agrees with
where the pbs_server thinks the job should be running, but
that's not where Maui thinks it should be.

I'd guess a communication error between Maui and Torque,
though your Torque version is pretty old...

I notice that you're running a very old version of Maui,
there have been 4 more snapshot releases since that one
plus an entire new version (3.3) - any chance you could
try with a newer one in case it's been fixed since then ?

cheers,
Chris
- -- 
 Christopher Samuel - Senior Systems Administrator
 VLSCI - Victorian Life Sciences Computational Initiative
 Email: samuel at unimelb.edu.au Phone: +61 (0)3 903 55545
         http://www.vlsci.unimelb.edu.au/

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.10 (GNU/Linux)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org/

iEYEARECAAYFAk0RL94ACgkQO2KABBYQAh8gUgCfQ7uOkG0xZDIUb6KDhYvktgNQ
QO8AoIqIys16uGGrblBQRp6w00fyOV3I
=Ct1F
-----END PGP SIGNATURE-----


More information about the torqueusers mailing list