[Mauiusers] Maui killing job for resource violation,
can't find violation
Jim Kusznir
jkusznir at gmail.com
Wed May 28 13:16:16 MDT 2008
Hi all:
I have a user who's having trouble with his job submissions being
killed. Here's the maui log (grepped for his job id).
http://pastebin.ca/1032376
Here's the qsub script in use:
##### declare a name for this job to be sample_job
#PBS -N MM5_mpi_job
##### request # of nodes
#PBS -l nodes=8
##### request hours and minutes of cpu time
#PBS -l cput=08:00:00
##### request hours and minutes of Wall time
#PBS -l walltime=08:00:00
##### mail is sent to you when the job starts and when it terminates or
aborts
#PBS -m bea
##### specify your email address
#PBS -M aeoluswsueecs at yahoo.com
# By default, PBS scripts execute in your home directory, not the
# directory from which they were submitted. The following line
# places you in the directory from which the job was submitted.
cd $PBS_O_WORKDIR
# run the program
#/path_to_executable/program_name arg1 arg2 ...
mpirun /mnt/pvfs2/ramos/MM5/Run/mm5.exe
exit(0)
---------------
My interpretation of what's happening: maui sees the request for 8
cpus, and schedules accordingly. Then realizes that its using more
than 1 cpu and kills the job.
I've tried with a more simplistic script, but with no difference. Any
suggestions?
Thanks!
--Jim
More information about the mauiusers
mailing list