Portable performance on asymmetric multicore processors

Ivan Jibaja, Ting Cao, Stephen M. Blackburn, Kathryn S. McKinley

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    21 Citations (Scopus)

    Abstract

    Static and dynamic power constraints are steering chip manufacturers to build single-ISA Asymmetric Multicore Processors (AMPs) with big and small cores. To deliver on their energy efficiency potential, schedulers must consider core sensitivity, load balance, and the critical path. Applying these criteria effectively is challenging especially for complex and non-scalable multithreaded applications. We demonstrate that runtimes for managed languages, which are now ubiquitous, provide a unique opportunity to abstract over AMP complexity and inform scheduling with rich semantics such as thread priorities, locks, and parallelism-information not directly available to the hardware, OS, or application. We present the WASH AMP scheduler, which (1) automatically identifies and accelerates critical threads in concurrent, but non-scalable applications; (2) respects thread priorities; (3) considers core availability and thread sensitivity; and (4) proportionally schedules threads on big and small cores to optimize performance and energy. We introduce new dynamic analyses that identify critical threads and classify applications as sequential, scalable, or non-scalable. Compared to prior work, WASH improves performance by 20% and energy by 9% or more on frequency-scaled AMP hardware (not simulation). Performance advantages grow to 27% when asymmetry increases. Performance advantages are robust to a complex multithreaded adversary independently scheduled by the OS. WASH effectively identifies and optimizes a wider class of workloads than prior work.

    Original languageEnglish
    Title of host publicationProceedings of the 14th International Symposium on Code Generation and Optimization, CGO 2016
    PublisherAssociation for Computing Machinery, Inc
    Pages24-35
    Number of pages12
    ISBN (Electronic)9781450337786
    DOIs
    Publication statusPublished - 29 Feb 2016
    Event14th Annual IEEE/ACM International Symposium on Code Generation and Optimization, CGO 2016 - Barcelona, Spain
    Duration: 12 Mar 201618 Mar 2016

    Publication series

    NameProceedings of the 14th International Symposium on Code Generation and Optimization, CGO 2016

    Conference

    Conference14th Annual IEEE/ACM International Symposium on Code Generation and Optimization, CGO 2016
    Country/TerritorySpain
    CityBarcelona
    Period12/03/1618/03/16

    Fingerprint

    Dive into the research topics of 'Portable performance on asymmetric multicore processors'. Together they form a unique fingerprint.

    Cite this