Implementation and performance of scalable scientific library subroutines on Fujitsu's VPP500 parallel-vector supercomputer

R. Brent*, A. Cleary, M. Hegland, J. Jenkinson, Z. Leyk, M. Osborne, P. Price, S. Roberts, D. Singleton, M. Nakanishi

*Corresponding author for this work

Research output: Contribution to conferencePaperpeer-review

Abstract

We report progress to date on our project to implement high-impact scientific subroutines on Fujitsu's parallel-vector VPP500. Areas covered in the project are generally between the level of basic building blocks and complete applications, including such things as random number generators, fast Fourier transforms, various linear equation solvers, and eigenvalue solvers. Highlights so far include a suite of fast Fourier transform methods with extensive functionality and performance of approximately one third of peak; a parallel random number generator guaranteed to not repeat sequences on different processors, yet reproducible over separate runs, that produces randoms in 2.2 machine cycles; and a Gaussian elimination code that has achieved over a Gflop per processor for 32 processors of the VPP500, and 124.5 Gflops total on the Fujitsu-built Numerical Wind Tunnel, a machine very similar architecturally to the VPP500.

Original languageEnglish
Pages526-533
Number of pages8
Publication statusPublished - 1994
EventProceedings of the Scalable High-Performance Computing Conference - Knoxville, TN, USA
Duration: 23 May 199425 May 1994

Conference

ConferenceProceedings of the Scalable High-Performance Computing Conference
CityKnoxville, TN, USA
Period23/05/9425/05/94

Fingerprint

Dive into the research topics of 'Implementation and performance of scalable scientific library subroutines on Fujitsu's VPP500 parallel-vector supercomputer'. Together they form a unique fingerprint.

Cite this