Computer performance microscopy with SHIM

Xi Yang, Stephen M. Blackburn, Kathryn S. McKinley

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    16 Citations (Scopus)

    Abstract

    Developers and architects spend a lot of time trying to understand and eliminate performance problems. Unfortunately, the root causes of many problems occur at a fine granularity that existing continuous profiling and direct measurement approaches cannot observe. This paper presents the design and implementation of Shim, a continuous profiler that samples at resolutions as fine as 15 cycles; three to five orders of magnitude finer than current continuous profilers. Shim's fine-grain measurements reveal new behaviors, such as variations in instructions per cycle (IPC) within the execution of a single function. A Shim observer thread executes and samples autonomously on unutilized hardware. To sample, it reads hardware performance counters and memory locations that store software state. Shim improves its accuracy by automatically detecting and discarding samples affected by measurement skew. We measure Shim's observer effects and show how to analyze them. When on a separate core, Shim can continuously observe one software signal with a 2% overhead at a ∼1200 cycle resolution. At an overhead of 61%, Shim samples one software signal on the same core with SMT at a ∼15 cycle resolution. Modest hardware changes could significantly reduce overheads and add greater analytical capability to Shim. We vary prefetching and DVFS policies in case studies that show the diagnostic power of fine-grain IPC and memory bandwidth results. By repurposing existing hardware, we deliver a practical tool for fine-grain performance microscopy for developers and architects.

    Original languageEnglish
    Title of host publicationISCA 2015 - 42nd Annual International Symposium on Computer Architecture, Conference Proceedings
    PublisherInstitute of Electrical and Electronics Engineers Inc.
    Pages170-184
    Number of pages15
    ISBN (Electronic)9781450334020
    DOIs
    Publication statusPublished - 13 Jun 2015
    Event42nd Annual International Symposium on Computer Architecture, ISCA 2015 - Portland, United States
    Duration: 13 Jun 201517 Jun 2015

    Publication series

    NameProceedings - International Symposium on Computer Architecture
    Volume13-17-June-2015
    ISSN (Print)1063-6897

    Conference

    Conference42nd Annual International Symposium on Computer Architecture, ISCA 2015
    Country/TerritoryUnited States
    CityPortland
    Period13/06/1517/06/15

    Fingerprint

    Dive into the research topics of 'Computer performance microscopy with SHIM'. Together they form a unique fingerprint.

    Cite this