A Compilation and Run-Time Framework for Maximizing Performance of Self-scheduling Algorithms

Yizhuo Wang; Laleh Aghababaie Beni; Alexandru Nicolau; Alexander V. Veidenbaum; Rosario Cammarota

doi:10.1007/978-3-662-44917-2_38

Communication Dans Un Congrès Année : 2014

A Compilation and Run-Time Framework for Maximizing Performance of Self-scheduling Algorithms

(1) , (2) , (2) , (2) , (3)

1
2
3

Yizhuo Wang

Fonction : Auteur
PersonId : 994391

Beijing Institute of Technology

Laleh Aghababaie Beni

Fonction : Auteur

University of California [Irvine]

Alexandru Nicolau

Fonction : Auteur

University of California [Irvine]

Alexander V. Veidenbaum

Fonction : Auteur

University of California [Irvine]

Rosario Cammarota

Fonction : Auteur

Qualcomm Research

Résumé

Ordinary programs contain many parallel loops which account for a significant portion of these programs’ completion time. The parallel executions of such loops can significantly speedup performance of modern multi-core systems. We propose a new framework - Locality Aware Self-scheduling (LASS) - for scheduling parallel loops to multi-core systems and boost up performance of known self-scheduling algorithms in diverse execution conditions. LASS enforces data locality, by forcing the execution of consecutive chunks of iterations to the same core, and favours load balancing with the introduction of a work-stealing mechanism. LASS is evaluated on a set of kernels on a multi-core system with 16 cores. Two execution scenarios are considered. In the first scenario our application runs alone on top of the operating system. In the second scenario our application runs in conjunction with an interfering parallel job. The average speedup achieved by LASS for first execution scenario is 11% and for the second one is 31%.

Mots clés

loop scheduling self-scheduling random forest

Domaines

Informatique [cs]

Fichier principal

978-3-662-44917-2_38_Chapter.pdf (789.95 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Hal Ifip : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-01403116

Soumis le : vendredi 25 novembre 2016-14:38:16

Dernière modification le : mardi 29 novembre 2022-12:08:09

Dates et versions

hal-01403116 , version 1 (25-11-2016)

Licence

Paternité

Identifiants

HAL Id : hal-01403116 , version 1
DOI : 10.1007/978-3-662-44917-2_38

Citer

Yizhuo Wang, Laleh Aghababaie Beni, Alexandru Nicolau, Alexander V. Veidenbaum, Rosario Cammarota. A Compilation and Run-Time Framework for Maximizing Performance of Self-scheduling Algorithms. 11th IFIP International Conference on Network and Parallel Computing (NPC), Sep 2014, Ilan, Taiwan. pp.459-470, ⟨10.1007/978-3-662-44917-2_38⟩. ⟨hal-01403116⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

IFIP-LNCS IFIP IFIP-AICT IFIP-TC IFIP-LNCS-8707 IFIP-TC10 IFIP-NPC IFIP-WG10-3

46 Consultations

110 Téléchargements

A Compilation and Run-Time Framework for Maximizing Performance of Self-scheduling Algorithms

Résumé

Mots clés

Domaines

Dates et versions

Licence

Identifiants

Citer

Exporter

Collections

Altmetric

Partager