[torqueusers] Wired bug in torque 4.1.3 and 4.1.4 (blcr module not loaded)

LAHAYE Olivier olivier.lahaye at cea.fr
Wed Dec 12 08:03:45 MST 2012


Hi,

I think I've trigguered a realy wired bug in torque.
I've built torque with blcr support.

If the blcr module is not loaded on a node where a job is scheduled to run, the job hangs with various errors ranging from (nothing) to unable to setup IO or the like.

What helped me is that pbsdsh issued a warning in the job error log and once I fixed the blcr issue (started the blcr service that was reponsible of modprobing the module), the whole pbs system was running fine.

I'm not skilled enough to find the exact problem, but if that can help, at least it's better than nothing.

See below the --about (config options) and after that, the log from pbs_mom on the executing node:


/opt/pbs/sbin/pbs_server --about
package:     torque 4.1.4
sourcedir:   /root/rpmbuild/BUILD/torque-4.1.4
configure:    '--prefix=/opt/pbs' '--mandir=/opt/pbs/man' '--libdir=/opt/pbs/lib64' '--includedir=/opt/pbs/include' '--with-server-home=/var/lib/torque' '--with-pam=/lib64/security' '--with-sendmail=/usr/sbin/sendmail' '--with-default-server=pbs_oscar' '--with-server-name-file=server_name' '--enable-gui' '--enable-syslog' '--with-tcl' '--enable-rpp' '--with-rcp=scp' '--enable-drmaa' '--enable-blcr' '--enable-nvidia-gpus' '--enable-munge-auth' 'CC=' 'CFLAGS=' 'LDFLAGS=' 'PKG_CONFIG_PATH=/usr/lib64/pkgconfig:/usr/share/pkgconfig'
buildcflags:  -D_LARGEFILE64_SOURCE -DMUNGE_AUTH
buildhost:   is005045.intra.cea.fr
builddate:   Tue Dec 11 14:06:01 CET 2012
builddir:    /root/rpmbuild/BUILD/torque-4.1.4
builduser:   root
installdir:  /opt/pbs
serverhome:  /var/lib/torque
version:     4.1.4-snap.201211201307

[...] (mom_log on oscarnode49)
12/11/2012 15:06:11;0008;   pbs_mom.3661;Job;15.is003274.intra.cea.fr;req_commit:starting job execution
12/11/2012 15:06:11;0001;   pbs_mom.3661;Job;job_nodes;0: oscarnode49/11
12/11/2012 15:06:11;0001;   pbs_mom.3661;Job;job_nodes;1: oscarnode49/10
12/11/2012 15:06:11;0001;   pbs_mom.3661;Job;job_nodes;2: oscarnode49/9
12/11/2012 15:06:11;0001;   pbs_mom.3661;Job;job_nodes;3: oscarnode49/8
12/11/2012 15:06:11;0001;   pbs_mom.3661;Job;job_nodes;4: oscarnode49/7
12/11/2012 15:06:11;0001;   pbs_mom.3661;Job;job_nodes;5: oscarnode49/6
12/11/2012 15:06:11;0001;   pbs_mom.3661;Job;job_nodes;6: oscarnode49/5
12/11/2012 15:06:11;0001;   pbs_mom.3661;Job;job_nodes;7: oscarnode49/4
12/11/2012 15:06:11;0001;   pbs_mom.3661;Job;job_nodes;8: oscarnode49/3
12/11/2012 15:06:11;0001;   pbs_mom.3661;Job;job_nodes;9: oscarnode49/2
12/11/2012 15:06:11;0001;   pbs_mom.3661;Job;job_nodes;10: oscarnode49/1
12/11/2012 15:06:11;0001;   pbs_mom.3661;Job;job_nodes;11: oscarnode49/0
12/11/2012 15:06:11;0001;   pbs_mom.3661;Job;job_nodes;12: oscarnode48/11
12/11/2012 15:06:11;0001;   pbs_mom.3661;Job;job_nodes;13: oscarnode48/10
12/11/2012 15:06:11;0001;   pbs_mom.3661;Job;job_nodes;14: oscarnode48/9
12/11/2012 15:06:11;0001;   pbs_mom.3661;Job;job_nodes;15: oscarnode48/8
12/11/2012 15:06:11;0001;   pbs_mom.3661;Job;job_nodes;16: oscarnode48/7
12/11/2012 15:06:11;0001;   pbs_mom.3661;Job;job_nodes;17: oscarnode48/6
12/11/2012 15:06:11;0001;   pbs_mom.3661;Job;job_nodes;18: oscarnode48/5
12/11/2012 15:06:11;0001;   pbs_mom.3661;Job;job_nodes;19: oscarnode48/4
12/11/2012 15:06:11;0001;   pbs_mom.3661;Job;job_nodes;20: oscarnode48/3
12/11/2012 15:06:11;0001;   pbs_mom.3661;Job;job_nodes;21: oscarnode48/2
12/11/2012 15:06:11;0001;   pbs_mom.3661;Job;job_nodes;22: oscarnode48/1
12/11/2012 15:06:11;0001;   pbs_mom.3661;Job;job_nodes;23: oscarnode48/0
12/11/2012 15:06:11;0001;   pbs_mom.3661;Job;job_nodes;24: oscarnode47/5
12/11/2012 15:06:11;0001;   pbs_mom.3661;Job;job_nodes;25: oscarnode47/4
12/11/2012 15:06:11;0001;   pbs_mom.3661;Job;job_nodes;26: oscarnode47/3
12/11/2012 15:06:11;0001;   pbs_mom.3661;Job;job_nodes;27: oscarnode47/2
12/11/2012 15:06:11;0001;   pbs_mom.3661;Job;job_nodes;28: oscarnode47/1
12/11/2012 15:06:11;0001;   pbs_mom.3661;Job;job_nodes;29: oscarnode47/0
12/11/2012 15:06:11;0001;   pbs_mom.3661;Job;job_nodes;job: 15.is003274.intra.cea.fr numnodes=3 numvnod=30
12/11/2012 15:06:11;0001;   pbs_mom.3661;Svr;pbs_mom;LOG_DEBUG::init_groups, pre-sigprocmask
12/11/2012 15:06:11;0001;   pbs_mom.3661;Svr;pbs_mom;LOG_DEBUG::init_groups, post-initgroups
12/11/2012 15:06:11;0002;   pbs_mom.3661;Job;15.is003274.intra.cea.fr;allocate_demux_sockets: stdout: 10:56644  stderr: 11:43813
12/11/2012 15:06:11;0008;   pbs_mom.3661;Job;15.is003274.intra.cea.fr;start_exec: total wire-up time for job 0.2247
12/11/2012 15:06:11;0001;   pbs_mom.3661;Svr;pbs_mom;LOG_DEBUG::mom_checkpoint_job_has_checkpoint, FALSE
12/11/2012 15:06:11;0001;   pbs_mom.3661;Job;15.is003274.intra.cea.fr;about to fork child which will become job
12/11/2012 15:06:11;0001;   pbs_mom.3661;Job;15.is003274.intra.cea.fr;phase 2 of job launch successfully completed
12/11/2012 15:06:11;0002;   pbs_mom.3977;n/a;mom_close_poll;entered
12/11/2012 15:06:11;0001;   pbs_mom.3661;Job;15.is003274.intra.cea.fr;task/session info loaded
12/11/2012 15:06:11;0001;   pbs_mom.3661;Job;TMomFinalizeJob3;Job 15.is003274.intra.cea.fr read start return code=0 session=3977
12/11/2012 15:06:11;0001;   pbs_mom.3661;Job;15.is003274.intra.cea.fr;saving task (TMomFinalizeJob3)
12/11/2012 15:06:11;0008;   pbs_mom.3661;Svr;task_save;saving task in /var/lib/torque/mom_priv/jobs/15.is003274.intra.cea.fr.TK
12/11/2012 15:06:11;0001;   pbs_mom.3661;Job;TMomFinalizeJob3;job 15.is003274.intra.cea.fr started, pid = 3977
12/11/2012 15:06:11;0001;   pbs_mom.3661;Job;15.is003274.intra.cea.fr;exec_job_on_ms:job successfully started
12/11/2012 15:06:11;0008;   pbs_mom.3661;Job;15.is003274.intra.cea.fr;req_commit:job execution started
12/11/2012 15:06:11;0008;   pbs_mom.3661;Job;tcp_request;tcp_request: fd 8 addr 127.0.0.1:43387
12/11/2012 15:06:11;0008;   pbs_mom.3661;Job;15.is003274.intra.cea.fr;tm_request: job 15.is003274.intra.cea.fr cookie CAAFC3D6302C31FCF9BD92DE9205655D task 1 com 100 event 1
12/11/2012 15:06:11;0002;   pbs_mom.3661;node;close_conn;Connection 8 - func 414387
12/11/2012 15:06:11;0008;   pbs_mom.3661;Job;15.is003274.intra.cea.fr;matching task located, marking interface closed
12/11/2012 15:06:11;0008;   pbs_mom.3661;Job;tcp_request;tcp_request: fd 8 addr 127.0.0.1:43388
12/11/2012 15:06:11;0008;   pbs_mom.3661;Job;15.is003274.intra.cea.fr;tm_request: job 15.is003274.intra.cea.fr cookie CAAFC3D6302C31FCF9BD92DE9205655D task 1 com 102 event 2
12/11/2012 15:06:11;0008;   pbs_mom.3661;Job;15.is003274.intra.cea.fr;tm_spawn_request: SPAWN 15.is003274.intra.cea.fr on node 0
12/11/2012 15:06:11;0001;   pbs_mom.3661;Job;15.is003274.intra.cea.fr;saving task (TM_SPAWN)
12/11/2012 15:06:11;0008;   pbs_mom.3661;Svr;task_save;saving task in /var/lib/torque/mom_priv/jobs/15.is003274.intra.cea.fr.TK
12/11/2012 15:06:11;0002;   pbs_mom.4000;n/a;mom_close_poll;entered
12/11/2012 15:06:31;0001;   pbs_mom.3661;Job;15.is003274.intra.cea.fr;task not started, 'hostname', stdio setup failed (see syslog)
12/11/2012 15:06:31;0008;   pbs_mom.3661;Job;scan_for_terminated;entered
12/11/2012 15:06:31;0080;   pbs_mom.3661;Svr;mom_get_sample;proc_array load started
12/11/2012 15:06:31;0080;   pbs_mom.3661;n/a;mom_get_sample;proc_array loaded - nproc=285
12/11/2012 15:06:31;0080;   pbs_mom.3661;n/a;cput_sum;proc_array loop start - jobid = 15.is003274.intra.cea.fr
12/11/2012 15:06:31;0002;   pbs_mom.3661;n/a;cput_sum;cput_sum: session=3977 pid=3977 cputime=0 (cputfactor=1.000000)
12/11/2012 15:06:31;0002;   pbs_mom.3661;n/a;cput_sum;cput_sum: session=3977 pid=3998 cputime=0 (cputfactor=1.000000)
12/11/2012 15:06:31;0002;   pbs_mom.3661;n/a;cput_sum;cput_sum: session=3977 pid=3999 cputime=0 (cputfactor=1.000000)
12/11/2012 15:06:31;0080;   pbs_mom.3661;n/a;mem_sum;proc_array loop start - jobid = 15.is003274.intra.cea.fr
12/11/2012 15:06:31;0002;   pbs_mom.3661;n/a;mem_sum;mem_sum: session=3977 pid=3977 vsize=16019456 sum=16019456
12/11/2012 15:06:31;0002;   pbs_mom.3661;n/a;mem_sum;mem_sum: session=3977 pid=3998 vsize=9412608 sum=25432064
12/11/2012 15:06:31;0002;   pbs_mom.3661;n/a;mem_sum;mem_sum: session=3977 pid=3999 vsize=55603200 sum=81035264
12/11/2012 15:06:31;0080;   pbs_mom.3661;n/a;resi_sum;proc_array loop start - jobid = 15.is003274.intra.cea.fr
12/11/2012 15:06:31;0002;   pbs_mom.3661;n/a;resi_sum;resi_sum: session=3977 pid=3977 rss=1708032 sum=1708032
12/11/2012 15:06:31;0002;   pbs_mom.3661;n/a;resi_sum;resi_sum: session=3977 pid=3998 rss=1302528 sum=3010560
12/11/2012 15:06:31;0002;   pbs_mom.3661;n/a;resi_sum;resi_sum: session=3977 pid=3999 rss=2523136 sum=5533696
12/11/2012 15:06:31;0008;   pbs_mom.3661;Job;scan_for_terminated;pid 4000 not tracked, statloc=65024, exitval=254
12/11/2012 15:06:31;0002;   pbs_mom.3661;node;close_conn;Connection 8 - func 414387
12/11/2012 15:06:31;0008;   pbs_mom.3661;Job;15.is003274.intra.cea.fr;matching task located, marking interface closed
12/11/2012 15:06:31;0008;   pbs_mom.3661;Job;tcp_request;tcp_request: fd 8 addr 127.0.0.1:43399
12/11/2012 15:06:31;0008;   pbs_mom.3661;Job;15.is003274.intra.cea.fr;tm_request: job 15.is003274.intra.cea.fr cookie CAAFC3D6302C31FCF9BD92DE9205655D task 1 com 102 event 3
12/11/2012 15:06:31;0008;   pbs_mom.3661;Job;15.is003274.intra.cea.fr;tm_spawn_request: SPAWN 15.is003274.intra.cea.fr on node 1
12/11/2012 15:06:31;0008;   pbs_mom.3661;Job;tcp_request;tcp_request: fd 10 addr 10.0.238.149:606
12/11/2012 15:06:31;0002;   pbs_mom.3661;Svr;im_request;connect from 10.0.238.149:606
12/11/2012 15:06:31;0008;   pbs_mom.3661;Job;15.is003274.intra.cea.fr;im_request:rec req 'SPAWN_TASK' (3) for job 15.is003274.intra.cea.fr from 10.0.238.149:606 ev 3 task 1 cookie CAAFC3D6302C31FCF9BD92DE9205655D
12/11/2012 15:06:31;0008;   pbs_mom.3661;Job;15.is003274.intra.cea.fr;INFO:     received request 'SPAWN_TASK' from 10.0.238.149:606 for job '15.is003274.intra.cea.fr' (spawning task on node '0' with taskid=3, globid='none'
12/11/2012 15:06:31;0008;   pbs_mom.3661;Job;15.is003274.intra.cea.fr;saving task (IM_SPAWN_TASK)
12/11/2012 15:06:31;0008;   pbs_mom.3661;Svr;task_save;saving task in /var/lib/torque/mom_priv/jobs/15.is003274.intra.cea.fr.TK
12/11/2012 15:06:31;0002;   pbs_mom.4001;n/a;mom_close_poll;entered
12/11/2012 15:06:51;0001;   pbs_mom.3661;Job;15.is003274.intra.cea.fr;task not started, 'hostname', stdio setup failed (see syslog)
12/11/2012 15:06:51;0008;   pbs_mom.3661;Job;15.is003274.intra.cea.fr;ERROR:    received request 'SPAWN_TASK' from 10.0.238.149:606 for job '15.is003274.intra.cea.fr' (cannot start task)
12/11/2012 15:06:51;0008;   pbs_mom.3661;Job;scan_for_terminated;entered
12/11/2012 15:06:51;0080;   pbs_mom.3661;Svr;mom_get_sample;proc_array load started
12/11/2012 15:06:51;0080;   pbs_mom.3661;n/a;mom_get_sample;proc_array loaded - nproc=285
12/11/2012 15:06:51;0080;   pbs_mom.3661;n/a;cput_sum;proc_array loop start - jobid = 15.is003274.intra.cea.fr
12/11/2012 15:06:51;0002;   pbs_mom.3661;n/a;cput_sum;cput_sum: session=3977 pid=3977 cputime=0 (cputfactor=1.000000)
12/11/2012 15:06:51;0002;   pbs_mom.3661;n/a;cput_sum;cput_sum: session=3977 pid=3998 cputime=0 (cputfactor=1.000000)
12/11/2012 15:06:51;0002;   pbs_mom.3661;n/a;cput_sum;cput_sum: session=3977 pid=3999 cputime=0 (cputfactor=1.000000)
12/11/2012 15:06:51;0080;   pbs_mom.3661;n/a;mem_sum;proc_array loop start - jobid = 15.is003274.intra.cea.fr
12/11/2012 15:06:51;0002;   pbs_mom.3661;n/a;mem_sum;mem_sum: session=3977 pid=3977 vsize=16019456 sum=16019456
12/11/2012 15:06:51;0002;   pbs_mom.3661;n/a;mem_sum;mem_sum: session=3977 pid=3998 vsize=9412608 sum=25432064
12/11/2012 15:06:51;0002;   pbs_mom.3661;n/a;mem_sum;mem_sum: session=3977 pid=3999 vsize=55603200 sum=81035264
12/11/2012 15:06:51;0080;   pbs_mom.3661;n/a;resi_sum;proc_array loop start - jobid = 15.is003274.intra.cea.fr
12/11/2012 15:06:51;0002;   pbs_mom.3661;n/a;resi_sum;resi_sum: session=3977 pid=3977 rss=1708032 sum=1708032
12/11/2012 15:06:51;0002;   pbs_mom.3661;n/a;resi_sum;resi_sum: session=3977 pid=3998 rss=1302528 sum=3010560
12/11/2012 15:06:51;0002;   pbs_mom.3661;n/a;resi_sum;resi_sum: session=3977 pid=3999 rss=2543616 sum=5554176
12/11/2012 15:06:51;0008;   pbs_mom.3661;Job;scan_for_terminated;pid 4001 not tracked, statloc=65024, exitval=254
12/11/2012 15:06:51;0008;   pbs_mom.3661;Job;tcp_request;tcp_request: fd 10 addr 10.0.238.149:692
12/11/2012 15:06:51;0002;   pbs_mom.3661;Svr;im_request;connect from 10.0.238.149:692
12/11/2012 15:06:51;0008;   pbs_mom.3661;Job;15.is003274.intra.cea.fr;im_request:rec req 'ERROR' (99) for job 15.is003274.intra.cea.fr from 10.0.238.149:692 ev 3 task 1 cookie CAAFC3D6302C31FCF9BD92DE9205655D
12/11/2012 15:06:51;0001;   pbs_mom.3661;Svr;pbs_mom;LOG_ERROR::im_request, Response recieved from client 10.0.238.149:692 (15003) jobid 15.is003274.intra.cea.fr
12/11/2012 15:06:51;0008;   pbs_mom.3661;Job;15.is003274.intra.cea.fr;im_request: REQUEST 3 15.is003274.intra.cea.fr returned ERROR 17000
12/11/2012 15:06:51;0002;   pbs_mom.3661;node;close_conn;Connection 8 - func 414387
12/11/2012 15:06:51;0008;   pbs_mom.3661;Job;15.is003274.intra.cea.fr;matching task located, marking interface closed
12/11/2012 15:06:51;0008;   pbs_mom.3661;Job;tcp_request;tcp_request: fd 8 addr 127.0.0.1:43410
12/11/2012 15:06:51;0008;   pbs_mom.3661;Job;15.is003274.intra.cea.fr;tm_request: job 15.is003274.intra.cea.fr cookie CAAFC3D6302C31FCF9BD92DE9205655D task 1 com 102 event 4
12/11/2012 15:06:51;0008;   pbs_mom.3661;Job;15.is003274.intra.cea.fr;tm_spawn_request: SPAWN 15.is003274.intra.cea.fr on node 2
12/11/2012 15:06:51;0008;   pbs_mom.3661;Job;tcp_request;tcp_request: fd 10 addr 10.0.238.149:310
12/11/2012 15:06:51;0002;   pbs_mom.3661;Svr;im_request;connect from 10.0.238.149:310
12/11/2012 15:06:51;0008;   pbs_mom.3661;Job;15.is003274.intra.cea.fr;im_request:rec req 'SPAWN_TASK' (3) for job 15.is003274.intra.cea.fr from 10.0.238.149:310 ev 4 task 1 cookie CAAFC3D6302C31FCF9BD92DE9205655D
12/11/2012 15:06:51;0008;   pbs_mom.3661;Job;15.is003274.intra.cea.fr;INFO:     received request 'SPAWN_TASK' from 10.0.238.149:310 for job '15.is003274.intra.cea.fr' (spawning task on node '0' with taskid=4, globid='none'
12/11/2012 15:06:51;0008;   pbs_mom.3661;Job;15.is003274.intra.cea.fr;saving task (IM_SPAWN_TASK)
12/11/2012 15:06:51;0008;   pbs_mom.3661;Svr;task_save;saving task in /var/lib/torque/mom_priv/jobs/15.is003274.intra.cea.fr.TK
12/11/2012 15:06:51;0002;   pbs_mom.4002;n/a;mom_close_poll;entered
12/11/2012 15:07:11;0001;   pbs_mom.3661;Job;15.is003274.intra.cea.fr;task not started, 'hostname', stdio setup failed (see syslog)
12/11/2012 15:07:11;0008;   pbs_mom.3661;Job;15.is003274.intra.cea.fr;ERROR:    received request 'SPAWN_TASK' from 10.0.238.149:310 for job '15.is003274.intra.cea.fr' (cannot start task)
12/11/2012 15:07:11;0002;   pbs_mom.3661;n/a;mom_server_all_update_stat;composing status update for server
12/11/2012 15:07:11;0002;   pbs_mom.3661;n/a;sessions;sessions[0]: pid 2530 sid 2529
12/11/2012 15:07:11;0002;   pbs_mom.3661;n/a;sessions;sessions[1]: pid 3977 sid 3977
12/11/2012 15:07:11;0002;   pbs_mom.3661;n/a;sessions;sessions[2]: pid 3998 sid 3977
12/11/2012 15:07:11;0002;   pbs_mom.3661;n/a;sessions;sessions[2]: pid 3999 sid 3977
12/11/2012 15:07:11;0002;   pbs_mom.3661;n/a;sessions;nsessions=2
12/11/2012 15:07:11;0002;   pbs_mom.3661;n/a;sessions;sessions[0]: pid 2530 sid 2529
12/11/2012 15:07:11;0002;   pbs_mom.3661;n/a;sessions;sessions[1]: pid 3977 sid 3977
12/11/2012 15:07:11;0002;   pbs_mom.3661;n/a;sessions;sessions[2]: pid 3998 sid 3977
12/11/2012 15:07:11;0002;   pbs_mom.3661;n/a;sessions;sessions[2]: pid 3999 sid 3977
12/11/2012 15:07:11;0002;   pbs_mom.3661;n/a;sessions;nsessions=2
12/11/2012 15:07:11;0002;   pbs_mom.3661;n/a;sessions;sessions[0]: pid 2530 sid 2529
12/11/2012 15:07:11;0002;   pbs_mom.3661;n/a;sessions;sessions[1]: pid 3977 sid 3977
12/11/2012 15:07:11;0002;   pbs_mom.3661;n/a;sessions;sessions[2]: pid 3998 sid 3977
12/11/2012 15:07:11;0002;   pbs_mom.3661;n/a;sessions;sessions[2]: pid 3999 sid 3977
12/11/2012 15:07:11;0002;   pbs_mom.3661;n/a;sessions;nsessions=2
12/11/2012 15:07:11;0002;   pbs_mom.3661;n/a;nusers;nusers[0]: pid 2530 uid 496
12/11/2012 15:07:11;0002;   pbs_mom.3661;n/a;nusers;nusers[1]: pid 3977 uid 1116
12/11/2012 15:07:11;0002;   pbs_mom.3661;n/a;nusers;nusers[2]: pid 3998 uid 1116
12/11/2012 15:07:11;0002;   pbs_mom.3661;n/a;nusers;nusers[2]: pid 3999 uid 1116
12/11/2012 15:07:11;0002;   pbs_mom.3661;n/a;nusers;nusers=2
12/11/2012 15:07:11;0002;   pbs_mom.3661;n/a;totmem;totmem: total mem=51249725440
12/11/2012 15:07:11;0002;   pbs_mom.3661;n/a;availmem;availmem: free mem=50474262528
12/11/2012 15:07:11;0002;   pbs_mom.3661;node;ncpus;ncpus=12
12/11/2012 15:07:11;0001;   pbs_mom.3661;Svr;pbs_mom;LOG_DEBUG::gpus, gpus: GPU cmd issued: nvidia-smi -q -x 2>&1
12/11/2012 15:07:30;0002;   pbs_mom.3661;n/a;mom_server_update_stat;mom_server_update_stat: sending to server "opsys=linux"
12/11/2012 15:07:30;0002;   pbs_mom.3661;n/a;mom_server_update_stat;mom_server_update_stat: sending to server "uname=Linux oscarnode49 2.6.32-279.14.1.el6.x86_64 #1 SMP Tue Nov 6 23:43:09 UTC 2012 x86_64"
12/11/2012 15:07:30;0002;   pbs_mom.3661;n/a;mom_server_update_stat;mom_server_update_stat: sending to server "sessions=2529 3977"
12/11/2012 15:07:30;0002;   pbs_mom.3661;n/a;mom_server_update_stat;mom_server_update_stat: sending to server "nsessions=2"
12/11/2012 15:07:30;0002;   pbs_mom.3661;n/a;mom_server_update_stat;mom_server_update_stat: sending to server "nusers=2"
[...]

Olivier.

--
   Olivier LAHAYE
   CEA DRT/LIST/DCSI/DIR
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://www.supercluster.org/pipermail/torqueusers/attachments/20121212/58b852b8/attachment-0001.html 


More information about the torqueusers mailing list