[torqueusers] pbs_mom getsize() failed errors.
James A. Peltier
jpeltier at cs.sfu.ca
Thu Mar 5 22:19:09 MST 2009
Hi All,
Things are starting to stabalize on my cluster again. However, a couple
of nodes are still seeing errors. i386 nodes seem to be at issue now.
Mar 5 20:58:29 a08-nll pbs_mom: TMomFinalizeChild, about to create cpuset
for job 10204.queen.
Mar 5 20:58:29 a08-nll pbs_mom: create_jobset, CPUSET: 0 job 10204.queen
path /dev/cpuset/torque/10204.queen/cpus
Mar 5 20:58:29 a08-nll pbs_mom: create_jobset, TASKSET:
/dev/cpuset/torque/10204.queen/0/cpus cpus 0
Mar 5 20:58:29 a08-nll pbs_mom: move_to_jobset, CPUSET MOVE:
/dev/cpuset/torque/10204.queen/tasks 9486
Mar 5 20:58:29 a08-nll pbs_mom: Bad file descriptor (9) in
TMomFinalizeChild, getsize() failed for mem/pmem in mom_set_limits
a08-nll
state = free
np = 4
properties = matlab,freesurfer_v4.1.0
ntype = cluster
status = arch=i386,opsys=linux,uname=Linux a08-nll
2.6.18-92.1.22.el5PAE #1 SMP Tue Dec 16 12:36:25 EST 2008
i686,sessions=2527 2612 3548 9247 21042 24153 27555 28510,nsessions=8,nusers=7,idletime=1730,totmem=6715704kb,availmem=6366156kb,physmem=4675460kb,ncpus=4,loadave=0.29,netload=632908548,size=25348656kb:25614624kb,state=free,jobs=,varattr=,rectime=1236316680
--
James A. Peltier
Systems Analyst (FASNet), VIVARIUM Technical Director
Simon Fraser University - Burnaby Campus
Phone : 778-782-6573
Fax : 778-782-3045
E-Mail : jpeltier at sfu.ca
Website : http://www.fas.sfu.ca | http://vivarium.cs.sfu.ca
http://blogs.sfu.ca/people/jpeltier
MSN : subatomic_spam at hotmail.com
Your mouse has moved. Windows has detected hardware
changes that require a reboot. Click OK to reboot.
More information about the torqueusers
mailing list