[CSC 435] OpenBLAS timing.
Andrew J. Pounds
pounds_aj at mercer.edu
Mon Apr 23 15:48:48 EDT 2018
I think some of you are confused about the timing project. Everything
is done with 9000x9000 matrices.
You need to use *ONE* system that is in the ithaca cluster for...
1. GPU timing with the CudaBLAS (run 10 times in CUDA)
2. Use your own OpenMP Matrix Multiply code run on 1-8 threads (10 times
each)
Then on HAMMER
3. Use your own OpenMP Matrix Multiply code run on 1-40 cores (10 times
each)
4. Using the OpenBLAS Matrix Multiply code run on 1-40 cores (10 times each)
You are welcome to use the batch job code I gave you. I class tomorrow
maybe we will have time to complete some of these items.
--
Andrew J. Pounds, Ph.D. (pounds_aj at mercer.edu)
Professor of Chemistry and Computer Science
Mercer University, Macon, GA 31207 (478) 301-5627
http://faculty.mercer.edu/pounds_aj
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://theochem.mercer.edu/pipermail/csc435/attachments/20180423/d04440d0/attachment.html>
More information about the csc435
mailing list