Composing multiple StarPU applications over heterogeneous machines: a supervised approach

Andra-Ecaterina Hugo 1, 2 Abdou Guermouche 2, 3 Raymond Namyst 1, 2 Pierre-André Wacrenier 1, 2
1 RUNTIME - Efficient runtime systems for parallel architectures
Inria Bordeaux - Sud-Ouest, UB - Université de Bordeaux, CNRS - Centre National de la Recherche Scientifique : UMR5800
3 HiePACS - High-End Parallel Algorithms for Challenging Numerical Simulations
LaBRI - Laboratoire Bordelais de Recherche en Informatique, Inria Bordeaux - Sud-Ouest
Abstract : Enabling HPC applications to perform efficiently when invoking multiple parallel libraries simultaneously is a great challenge. Even if a single runtime system is used underneath, scheduling tasks or threads coming from different libraries over the same set of hardware resources introduces many issues, such as resource oversubscription, undesirable cache flushes or memory bus contention. This paper presents an extension of StarPU, a runtime system specifically designed for heterogeneous architectures, that allows multiple parallel codes to run concurrently with minimal interference. Such parallel codes run within scheduling contexts that provide confined execution environments which can be used to partition computing resources. Scheduling contexts can be dynamically resized to optimize the allocation of computing resources among concurrently running libraries. We introduce a hypervisor that automatically expands or shrinks contexts using feedback from the runtime system (e.g. resource utilization). We demonstrate the relevance of our approach using benchmarks invoking multiple high performance linear algebra kernels simultaneously on top of heterogeneous multicore machines. We show that our mechanism can dramatically improve the overall application run time (-34%), most notably by reducing the average cache miss ratio (-50%).
Complete list of metadatas

https://hal.inria.fr/hal-00824514
Contributor : Andra-Ecaterina Hugo <>
Submitted on : Tuesday, May 21, 2013 - 10:09:01 PM
Last modification on : Thursday, January 11, 2018 - 6:22:35 AM
Long-term archiving on : Thursday, August 22, 2013 - 4:08:26 AM

File

PID2692011.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-00824514, version 1

Citation

Andra-Ecaterina Hugo, Abdou Guermouche, Raymond Namyst, Pierre-André Wacrenier. Composing multiple StarPU applications over heterogeneous machines: a supervised approach. Third International Workshop on Accelerators and Hybrid Exascale Systems, May 2013, Boston, United States. ⟨hal-00824514⟩

Share

Metrics

Record views

935

Files downloads

881