TY - GEN
T1 - Dwarfs on accelerators
T2 - 47th International Conference on Parallel Processing, ICPP 2018
AU - Johnston, Beau
AU - Milthorpe, Josh
N1 - Publisher Copyright:
© 2018 Association for Computing Machinery.
PY - 2018/8/13
Y1 - 2018/8/13
N2 - For reasons of both performance and energy efficiency, high performance computing (HPC) hardware is becoming increasingly hetero-geneous. The OpenCL framework supports portable programming across a wide range of computing devices and is gaining influence in programming next-generation accelerators. To characterize the performance of these devices across a range of applications requires a diverse, portable and configurable benchmark suite, and OpenCL is an attractive programming model for this purpose. We present an extended and enhanced version of the Open-Dwarfs OpenCL benchmark suite, with a strong focus placed on the robustness of applications, curation of additional benchmarks with an increased emphasis on correctness of results and choice of problem size. Preliminary results and analysis are reported for eight benchmark codes on a diverse set of architectures - three Intel CPUs, five NVIDIA CPUs, six AMD CPUs and a Xeon Phi.
AB - For reasons of both performance and energy efficiency, high performance computing (HPC) hardware is becoming increasingly hetero-geneous. The OpenCL framework supports portable programming across a wide range of computing devices and is gaining influence in programming next-generation accelerators. To characterize the performance of these devices across a range of applications requires a diverse, portable and configurable benchmark suite, and OpenCL is an attractive programming model for this purpose. We present an extended and enhanced version of the Open-Dwarfs OpenCL benchmark suite, with a strong focus placed on the robustness of applications, curation of additional benchmarks with an increased emphasis on correctness of results and choice of problem size. Preliminary results and analysis are reported for eight benchmark codes on a diverse set of architectures - three Intel CPUs, five NVIDIA CPUs, six AMD CPUs and a Xeon Phi.
UR - http://www.scopus.com/inward/record.url?scp=85054873813&partnerID=8YFLogxK
U2 - 10.1145/3229710.3229729
DO - 10.1145/3229710.3229729
M3 - Conference contribution
SN - 9781450365239
T3 - ACM International Conference Proceeding Series
BT - 47th International Conference on Parallel Processing, ICPP 2018
PB - Association for Computing Machinery
Y2 - 13 August 2018 through 16 August 2018
ER -