TY - JOUR
T1 - Honing and proofing Astrophysical codes on the road to Exascale. Experiences from code modernization on many-core systems
AU - Cielo, Salvatore
AU - Iapichino, Luigi
AU - Baruffa, Fabio
AU - Bugli, Matteo
AU - Federrath, Christoph
N1 - Publisher Copyright:
© 2020 Elsevier B.V.
PY - 2020/11
Y1 - 2020/11
N2 - The complexity of modern and upcoming computing architectures poses severe challenges for code developers and application specialists, and forces them to expose the highest possible degree of parallelism, in order to make the best use of the available hardware. The Intel® Xeon Phi™ of second generation (code-named Knights Landing, henceforth KNL) is the latest many-core system, which implements several interesting hardware features like for example a large number of cores per node (up to 72), the 512 bits-wide vector registers and the high-bandwidth memory. The unique features of KNL make this platform a powerful testbed for modern HPC applications. The performance of codes on KNL is therefore a useful proxy of their readiness for future architectures. In this work we describe the lessons learnt during the optimization of the widely used codes for computational astrophysics P-GADGET3, FLASH and ECHO. Moreover, we present results for the visualization and analysis tools VISIT and yt. These examples show that modern architectures benefit from code optimization at different levels, even more than traditional multi-core systems. However, the level of modernization of typical community codes still needs improvements, for them to fully utilize resources of novel architectures.
AB - The complexity of modern and upcoming computing architectures poses severe challenges for code developers and application specialists, and forces them to expose the highest possible degree of parallelism, in order to make the best use of the available hardware. The Intel® Xeon Phi™ of second generation (code-named Knights Landing, henceforth KNL) is the latest many-core system, which implements several interesting hardware features like for example a large number of cores per node (up to 72), the 512 bits-wide vector registers and the high-bandwidth memory. The unique features of KNL make this platform a powerful testbed for modern HPC applications. The performance of codes on KNL is therefore a useful proxy of their readiness for future architectures. In this work we describe the lessons learnt during the optimization of the widely used codes for computational astrophysics P-GADGET3, FLASH and ECHO. Moreover, we present results for the visualization and analysis tools VISIT and yt. These examples show that modern architectures benefit from code optimization at different levels, even more than traditional multi-core systems. However, the level of modernization of typical community codes still needs improvements, for them to fully utilize resources of novel architectures.
KW - Astrophysics
KW - Intel Xeon Phi
KW - KNL
KW - Performance optimization
KW - Visualization
UR - http://www.scopus.com/inward/record.url?scp=85084941923&partnerID=8YFLogxK
U2 - 10.1016/j.future.2020.05.003
DO - 10.1016/j.future.2020.05.003
M3 - Article
SN - 0167-739X
VL - 112
SP - 93
EP - 107
JO - Future Generation Computer Systems
JF - Future Generation Computer Systems
ER -