TY - JOUR
T1 - High-Performance, Graphics Processing Unit-Accelerated Fock Build Algorithm
AU - Barca, Giuseppe M.J.
AU - Galvez-Vallejo, Jorge L.
AU - Poole, David L.
AU - Rendell, Alistair P.
AU - Gordon, Mark S.
N1 - Publisher Copyright:
©
PY - 2020/12/8
Y1 - 2020/12/8
N2 - We present a high-performance, GPU (graphics processing unit)-accelerated algorithm for building the Fock matrix. The algorithm is designed for efficient calculations on large molecular systems and uses a novel dynamic load balancing scheme that maximizes the GPU throughput and avoids thread divergence that could occur due to integral screening. Additionally, the code adopts a novel ERI digestion algorithm that exploits all forms of permutational symmetry, combines efficiently the evaluation of both Coulomb and exchange terms together, and eliminates explicit thread synchronization requirements. Performance results obtained using a number of large molecules reveal remarkable speedups up to 24.4× with respect to the QUICK GPU code and up to 237× with respect to the GAMESS CPU parallel code.
AB - We present a high-performance, GPU (graphics processing unit)-accelerated algorithm for building the Fock matrix. The algorithm is designed for efficient calculations on large molecular systems and uses a novel dynamic load balancing scheme that maximizes the GPU throughput and avoids thread divergence that could occur due to integral screening. Additionally, the code adopts a novel ERI digestion algorithm that exploits all forms of permutational symmetry, combines efficiently the evaluation of both Coulomb and exchange terms together, and eliminates explicit thread synchronization requirements. Performance results obtained using a number of large molecules reveal remarkable speedups up to 24.4× with respect to the QUICK GPU code and up to 237× with respect to the GAMESS CPU parallel code.
UR - http://www.scopus.com/inward/record.url?scp=85096624713&partnerID=8YFLogxK
U2 - 10.1021/acs.jctc.0c00768
DO - 10.1021/acs.jctc.0c00768
M3 - Article
SN - 1549-9618
VL - 16
SP - 7232
EP - 7238
JO - Journal of Chemical Theory and Computation
JF - Journal of Chemical Theory and Computation
IS - 12
ER -