Combining reduction with synchronization barrier on multi-core processors - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Pré-Publication, Document De Travail Année : 2022

Combining reduction with synchronization barrier on multi-core processors

Résumé

With the rise of multi-core processors with a large number of cores the need of shared memory reduction that perform efficiently on a large number of core is more pressing. Efficient shared memory reduction on these multi-core processors will help share memory programs being more efficient on these one. In this paper, we propose a reduction combined with barrier method that uses SIMD instructions to combine barriers signaling and reduction value read/write to minimize memory/cache traffic between cores thus, reducing barrier latency. We compare different barriers and reduction methods on three multi-core processors and show that proposed combining barrier/reduction method are 4 and 3.5 times faster than respectively GCC 11.1 and Intel 21.2 OpenMP 4.5 reduction.
Fichier principal
Vignette du fichier
Combining_reduction_with_synchronization_barrier_on_multi_core_processors(8).pdf (10.91 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-03948901 , version 1 (16-02-2022)
hal-03948901 , version 2 (14-03-2023)

Licence

Copyright (Tous droits réservés)

Identifiants

  • HAL Id : hal-03948901 , version 1

Citer

Aboul-Karim Mohamed El Maarouf, Luc Giraud, Abdou Guermouche, Thomas Guignon. Combining reduction with synchronization barrier on multi-core processors. 2022. ⟨hal-03948901v1⟩
208 Consultations
138 Téléchargements

Partager

Gmail Facebook X LinkedIn More