[CSC 435] Watch those job submissions
Andrew J. Pounds
pounds_aj at mercer.edu
Wed Apr 1 03:11:56 EDT 2026
I had to kill a few jobs this everning because they were hung or were
running on henry. Remember -- don't run on henry -- just submit jobs
from there. As far as hung jobs there is a safety net built in to
PBS/Torque for this -- the amount of runtime.
In your PBS script you can set the maximum amount of walltime.
#PBS -l nodes=1:lab218:ppn=10
#PBS -l walltime=2:00:00
of
#PBS -l nodes=1:lab218:ppn=10,walltime=2:00:00
Will set the amount of time your job is allowed to run to 2 hours. On
these problems I can't imagine your benchmarking times taking more than,
say, 6 hours per test -- but some of you have set your times to multiple
days. Protect yourself and keep those times short so if you do have a
problem you are not waiting for multiple days to discover the problem.
To give context, I ran 200 benchmarking OpenBLAS jobs in under 10 minutes.
--
*/Andrew J. Pounds, Ph.D./*
/Professor of Chemistry and Computer Science/
/Director of the Computational Science Program/
/Mercer University/
/1501 Mercer University Drive, Macon, GA 31207 /
/(478) 301-5627/
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://theochem.mercer.edu/pipermail/csc435/attachments/20260401/28be02b2/attachment.htm>
More information about the csc435
mailing list