TY - GEN
T1 - High-Performance, Accurate Large-Scale Quantum Chemistry Calculations on GPU Supercomputers using Coulomb-Perturbed Fragmentation
AU - Kazemian, Fazeleh S.
AU - Galvez Vallejo, Jorge L.
AU - Barca, Giuseppe M.J.
N1 - Publisher Copyright:
© 2024 Owner/Author.
PY - 2024/8/12
Y1 - 2024/8/12
N2 - Predicting the chemico-physical properties of large molecular systems is a formidable challenge in chemistry and materials science. Traditional quantum mechanical methods, while accurate, have impractical scaling for large molecules with thousands of atoms, which are crucial in the development of novel therapeutics, catalysts, and nanomaterials. To address this, molecular fragmentation algorithms have been proposed to improve scalability and enable extensive parallelism. In this article, we introduce a significant enhancement to the Fragment Molecular Orbital (FMO) method, termed the Coulomb-Perturbed Fragmentation (CPF) method. CPF incorporates algorithmic improvements and implementation enhancements to optimize performance on heterogeneous computing systems equipped with a large number of GPUs. Key developments include a significant simplification of iteratitve self-consistent field (SCF) algorithm, advanced data management through a one-sided communication model, topology-aware optimizations, and a hybrid communication strategy for intra-group exchanges. Moreover, CPF integrates a distributed dynamic multi-layer load balancing scheme to optimise fragment distribution and workload management across nodes and GPUs. Performance evaluations on a 420-atom benzene molecule system comprising 35 fragments reveal that CPF outperforms existing GPU/CPU-based FMO algorithms in both efficiency and accuracy. When deployed on the Gadi supercomputer, CPF achieves over 97% parallel efficiency on 20 nodes, with scalability maintaining above 98% and 90% efficiency in weak-scaling tests for smaller and larger systems, respectively. Notably, CPF matches or exceeds the computational accuracy of conventional FMO methods, marking a substantial progress in the field of computational chemistry for fragmentation-based large-scale molecular modelling.
AB - Predicting the chemico-physical properties of large molecular systems is a formidable challenge in chemistry and materials science. Traditional quantum mechanical methods, while accurate, have impractical scaling for large molecules with thousands of atoms, which are crucial in the development of novel therapeutics, catalysts, and nanomaterials. To address this, molecular fragmentation algorithms have been proposed to improve scalability and enable extensive parallelism. In this article, we introduce a significant enhancement to the Fragment Molecular Orbital (FMO) method, termed the Coulomb-Perturbed Fragmentation (CPF) method. CPF incorporates algorithmic improvements and implementation enhancements to optimize performance on heterogeneous computing systems equipped with a large number of GPUs. Key developments include a significant simplification of iteratitve self-consistent field (SCF) algorithm, advanced data management through a one-sided communication model, topology-aware optimizations, and a hybrid communication strategy for intra-group exchanges. Moreover, CPF integrates a distributed dynamic multi-layer load balancing scheme to optimise fragment distribution and workload management across nodes and GPUs. Performance evaluations on a 420-atom benzene molecule system comprising 35 fragments reveal that CPF outperforms existing GPU/CPU-based FMO algorithms in both efficiency and accuracy. When deployed on the Gadi supercomputer, CPF achieves over 97% parallel efficiency on 20 nodes, with scalability maintaining above 98% and 90% efficiency in weak-scaling tests for smaller and larger systems, respectively. Notably, CPF matches or exceeds the computational accuracy of conventional FMO methods, marking a substantial progress in the field of computational chemistry for fragmentation-based large-scale molecular modelling.
KW - GPU
KW - molecular fragmentation
KW - quantum chemistry
UR - http://www.scopus.com/inward/record.url?scp=85202452199&partnerID=8YFLogxK
U2 - 10.1145/3673038.3673087
DO - 10.1145/3673038.3673087
M3 - Conference contribution
AN - SCOPUS:85202452199
T3 - ACM International Conference Proceeding Series
SP - 1092
EP - 1102
BT - 53rd International Conference on Parallel Processing, ICPP 2024 - Main Conference Proceedings
PB - Association for Computing Machinery (ACM)
T2 - 53rd International Conference on Parallel Processing, ICPP 2024
Y2 - 12 August 2024 through 15 August 2024
ER -