Skip to Main content Skip to Navigation

Vers des supports d'exécution capables d'exploiter les machines multicœurs hétérogènes

Cédric Augonnet 1, 2
1 RUNTIME - Efficient runtime systems for parallel architectures
Inria Bordeaux - Sud-Ouest, UB - Université de Bordeaux, CNRS - Centre National de la Recherche Scientifique : UMR5800
Abstract : Approaching the theoretical performance of heterogeneous multicore architectures, equipped with specialized accelerators, is a challenging issue. Unlike regular CPUs that can transparently access the whole global memory address range, accelerators usually embed local memory on which they perform all their computations using a specific instruction set. While many research efforts have been devoted to offloading parts of a program over such coprocessors, the real challenge is to find a programming model providing a unified view of all available computing units. In this document, we present an original runtime system providing a high-level, unified execution model allowing seamless execution of tasks over the underlying heterogeneous hardware. The runtime is based on a hierarchical memory management facility and on a codelet scheduler. We demonstrate the efficiency of our solution with a LU decomposition for both homogeneous (3.8 speedup on 4 cores) and heterogeneous machines (95% efficiency). We also show that a "granularity aware" scheduling can improve execution time by 35%.
Complete list of metadata

Cited literature [40 references]  Display  Hide  Download
Contributor : Cédric Augonnet <>
Submitted on : Friday, June 20, 2008 - 1:54:18 PM
Last modification on : Tuesday, February 9, 2021 - 3:12:03 PM
Long-term archiving on: : Friday, September 28, 2012 - 4:20:18 PM


Files produced by the author(s)


  • HAL Id : inria-00289361, version 1



Cédric Augonnet. Vers des supports d'exécution capables d'exploiter les machines multicœurs hétérogènes. [Travaux universitaires] 2008, pp.48. ⟨inria-00289361⟩



Record views


Files downloads