Scheduling Dynamic OpenMP Applications over Multicore Architectures

Approaching the theoretical performance of hierarchical multicore machines requires a very careful distribution of threads and data among the underlying non-uniform architecture in order to minimize cache misses and NUMA penalties. While it is acknowledged that OpenMP can enhance the quality of thread scheduling on such architectures in a portable way, by transmitting precious information about the affinities between threads and data to the underlying runtime system, most OpenMP runtime systems are actually unable to efficiently support highly irregular, massively parallel applications on NUMA machines. In this paper, we present a thread scheduling policy suited to the execution of OpenMP programs featuring irregular and massive nested parallelism over hierarchical architectures. Our policy enforces a distribution of threads that maximizes the proximity of threads belonging to the same parallel section, and uses a NUMA-aware work stealing strategy when load balancing is needed. It has been developed as a plug-in to the ForestGOMP OpenMP platform. We demonstrate the efficiency of our approach with a highly irregular recursive OpenMP program resulting from the generic parallelization of a surface reconstruction application. We achieve a speedup of 14 on a 16-core machine with no application-level optimization.

Mots clés

OpenMP Nested Parallelism Hierarchical Thread Scheduling Bubbles Multi-Core NUMA SMP

Domaines

Calcul parallèle, distribué et partagé [cs.DC]

Fichier principal

soumis.pdf (91.02 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Samuel Thibault : Connectez-vous pour contacter le contributeur

https://inria.hal.science/inria-00329934

Soumis le : lundi 13 octobre 2008-16:34:46

Dernière modification le : mercredi 3 avril 2024-11:24:09

Archivage à long terme le : mardi 9 octobre 2012-12:02:48

Dates et versions

inria-00329934 , version 1 (13-10-2008)

Identifiants

HAL Id : inria-00329934 , version 1
DOI : 10.1007/978-3-540-79561-2_15

Citer

François Broquedis, François Diakhate, Samuel Thibault, Olivier Aumage, Raymond Namyst, et al.. Scheduling Dynamic OpenMP Applications over Multicore Architectures. International Workshop on OpenMP, May 2008, West Lafayette, IN, United States. ⟨10.1007/978-3-540-79561-2_15⟩. ⟨inria-00329934⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CEA CNRS INRIA LABRI DAM INRIA2 ANR

366 Consultations

669 Téléchargements