A robust technique to make a 2D advection solver tolerant to soft faults

Peter Strazdins, Brendan Harding, Chung Lee, Jackson R. Mayo, Jaideep Ray, Robert C. Armstrong

    Research output: Contribution to journalConference articlepeer-review

    2 Citations (Scopus)

    Abstract

    We present a general technique to solve Partial Differential Equations, called robust stencils, which make them tolerant to soft faults, i.e. bit flips arising in memory or CPU calculations. We show how it can be applied to a two-dimensional Lax-Wendroff solver. The resulting 2D robust stencils are derived using an orthogonal application of their 1D counterparts. Combinations of 3 to 5 base stencils can then be created. We describe how these are then implemented in a parallel advection solver. Various robust stencil combinations are explored, representing tradeoff between performance and robustness. The results indicate that the 3-stencil robust combinations are slightly faster on large parallel workloads than Triple Modular Redundancy (TMR). They also have one third of the memory footprint. We expect the improvement to be significant if suitable optimizations are performed. Because faults are avoided each time new points are computed, the proposed stencils are also comparably robust to faults as TMR for a large range of error rates. The technique can be generalized to 3D (or higher dimensions) with similar benefits.

    Original languageEnglish
    Pages (from-to)1917-1926
    Number of pages10
    JournalProcedia Computer Science
    Volume80
    DOIs
    Publication statusPublished - 2016
    EventInternational Conference on Computational Science, ICCS 2016 - San Diego, United States
    Duration: 6 Jun 20168 Jun 2016

    Fingerprint

    Dive into the research topics of 'A robust technique to make a 2D advection solver tolerant to soft faults'. Together they form a unique fingerprint.

    Cite this