A simple performance model for multithreaded applications executing on non-uniform memory access computers

R. Yang*, J. Antony, A. P. Rendell

*Corresponding author for this work

    Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

    12 Citations (Scopus)

    Abstract

    In this work, we extend and evaluate a simple performance model to account for NUMA and bandwidth effects for single and multi-threaded calculations within the Gaussian 03 computational chemistry code on a contemporary multi-core, NUMA platform. By using the thread and memory placement APIs in Solaris, we present results for a set of calculations from which we analyze on-chip interconnect and intra-core bandwidth contention and show the importance of load-balancing between threads. The extended model predicts single threaded performance to within 1% errors and most multi-threaded experiments within 15% errors. Our results and modeling shows that accounting for bandwidth constraints within user-space code is beneficial.

    Original languageEnglish
    Title of host publication2009 11th IEEE International Conference on High Performance Computing and Communications, HPCC 2009
    Pages79-86
    Number of pages8
    DOIs
    Publication statusPublished - 2009
    Event11th IEEE International Conference on High Performance Computing and Communications, HPCC 2009 - Seoul, Korea, Republic of
    Duration: 25 Jun 200927 Jun 2009

    Publication series

    Name2009 11th IEEE International Conference on High Performance Computing and Communications, HPCC 2009

    Conference

    Conference11th IEEE International Conference on High Performance Computing and Communications, HPCC 2009
    Country/TerritoryKorea, Republic of
    CitySeoul
    Period25/06/0927/06/09

    Fingerprint

    Dive into the research topics of 'A simple performance model for multithreaded applications executing on non-uniform memory access computers'. Together they form a unique fingerprint.

    Cite this