[CSC 435] Watch those job submissions

Andrew J. Pounds pounds_aj at mercer.edu
Wed Apr 1 03:11:56 EDT 2026


I had to kill a few jobs this everning because they were hung or were 
running on henry.  Remember -- don't run on henry -- just submit jobs 
from there.  As far as hung jobs there is a safety net built in to 
PBS/Torque for this -- the amount of runtime.

In your PBS script you can set the maximum amount of walltime.

#PBS -l nodes=1:lab218:ppn=10
#PBS -l walltime=2:00:00

of

#PBS -l nodes=1:lab218:ppn=10,walltime=2:00:00

Will set the amount of time your job is allowed to run to 2 hours.  On 
these problems I can't imagine your benchmarking times taking more than, 
say, 6 hours per test -- but some of you have set your times to multiple 
days.  Protect yourself and keep those times short so if you do have a 
problem you are not waiting for multiple days to discover the problem.

To give context, I ran 200 benchmarking OpenBLAS jobs in under 10 minutes.


-- 
*/Andrew J. Pounds, Ph.D./*
/Professor of Chemistry and Computer Science/
/Director of the Computational Science Program/
/Mercer University/
/1501 Mercer University Drive, Macon, GA 31207 /
/(478) 301-5627/
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://theochem.mercer.edu/pipermail/csc435/attachments/20260401/28be02b2/attachment.htm>


More information about the csc435 mailing list