An Efficient OpenMP Runtime System for Hierarchical Architectures

Samuel Thibault 1, 2 François Broquedis 1, 2 Brice Goglin 1, 2 Raymond Namyst 1, 2 Pierre-André Wacrenier 1, 2
1 RUNTIME - Efficient runtime systems for parallel architectures
INRIA Futurs, Université Sciences et Technologies - Bordeaux 1, École Nationale Supérieure d'Électronique, Informatique et Radiocommunications de Bordeaux (ENSEIRB), CNRS - Centre National de la Recherche Scientifique : UMR5800
Abstract : Exploiting the full computational power of always deeper hierarchical multiprocessor machines requires a very careful distribution of threads and data among the underlying non-uniform architecture. The emergence of multi-core chips and NUMA machines makes it important to minimize the number of remote memory accesses, to favor cache affinities, and to guarantee fast completion of synchronization steps. By using the BubbleSched platform as a threading backend for the GOMP OpenMP compiler, we are able to easily transpose affinities of thread teams into scheduling hints using abstractions called bubbles. We then propose a scheduling strategy suited to nested OpenMP parallelism. The resulting preliminary performance evaluations show an important improvement of the speedup on a typical NAS OpenMP benchmark application.
Complete list of metadatas

https://hal.inria.fr/inria-00154502
Contributor : Samuel Thibault <>
Submitted on : Wednesday, June 13, 2007 - 6:50:53 PM
Last modification on : Wednesday, May 15, 2019 - 5:24:09 PM
Long-term archiving on : Thursday, April 8, 2010 - 8:08:58 PM

Files

main.pdf
Files produced by the author(s)

Identifiers

Collections

Citation

Samuel Thibault, François Broquedis, Brice Goglin, Raymond Namyst, Pierre-André Wacrenier. An Efficient OpenMP Runtime System for Hierarchical Architectures. International Workshop on OpenMP (IWOMP), Jun 2007, Beijing, China. pp.148--159, ⟨10.1007/978-3-540-69303-1_19⟩. ⟨inria-00154502⟩

Share

Metrics

Record views

408

Files downloads

982