A comparison of lookahead and algorithmic blocking techniques for parallel matrix factorization

P. E. Strazdins*

*Corresponding author for this work

    Research output: Contribution to journalArticlepeer-review

    20 Citations (Scopus)

    Abstract

    This article analyses and compares the techniques of algorithmic blocking and storage blocking with lookahead for distributed memory LU, LLT, and QR factorizations. Concepts and some useful properties of a simplified model of lookahead are explored. Issues in the implementation of lookahead are discussed, which are more involved for the case of LLT and QR factorizations. The article also explains how hybrid algorithmic blocking and lookahead techniques can be implemented. Results, given on the Fujitsu AP1000 and AP+ multicomputers, indicate that both methods are superior to storage blocking, and that the hybrid method is optimal for smaller matrices, due to savings in communication startups. For larger matrices, algorithmic blocking gave the best performance (excepting LLT for the AP+), due to its better load-balancing properties. Performance models, predicting the minimum matrix size where lookahead becomes effective, indicate this trend can be expected for machines with lower communication-to-computation speeds, but that the range for where lookahead is superior is extended.

    Original languageEnglish
    Pages (from-to)26-35
    Number of pages10
    JournalInternational Journal of Parallel and Distributed Systems and Networks
    Volume4
    Issue number1
    Publication statusPublished - 2001

    Fingerprint

    Dive into the research topics of 'A comparison of lookahead and algorithmic blocking techniques for parallel matrix factorization'. Together they form a unique fingerprint.

    Cite this