Hardware/Software Helper Thread Prefetching On Heterogeneous Many Cores

Bharath Narasimha Swamy; Alain Ketterlin; André Seznec

doi:10.1109/SBAC-PAD.2014.39

Communication Dans Un Congrès Année : 2014

Hardware/Software Helper Thread Prefetching On Heterogeneous Many Cores

(1) , (2, 3, 4) , (1)

1
2
3
4

Bharath Narasimha Swamy

Fonction : Auteur
PersonId : 961848

Amdahl's Law is Forever

Alain Ketterlin

Fonction : Auteur
PersonId : 739664
IdHAL : alainketterlin
IdRef : 058563830

Laboratoire des sciences de l'ingénieur, de l'informatique et de l'imagerie

MULTICORE

Compilation pour les Architectures MUlti-coeurS

André Seznec

Fonction : Auteur
PersonId : 13729
IdHAL : andre-seznec
ORCID : 0000-0002-3058-6503
IdRef : 033236402

Amdahl's Law is Forever

Résumé

—Heterogeneous Many Cores (HMC) architectures that mix many simple/small cores with a few complex/large cores are emerging as a design alternative that can provide both fast sequential performance for single threaded workloads and power-efficient execution for through-put oriented parallel workloads. The availability of many small cores in a HMC presents an opportunity to utilize them as low-power helper cores to accelerate memory-intensive sequential programs mapped to a large core. However, the latency overhead of accessing small cores in a loosely coupled system limits their utility as helper cores. Also, it is not clear if small cores can execute helper threads sufficiently in advance to benefit applications running on a larger, much powerful, core. In this paper, we present a hardware/software framework called core-tethering to support efficient helper threading on heterogeneous many-cores. Core-tethering provides a co-processor like interface to the small cores that (a) enables a large core to directly initiate and control helper execution on the helper core and (b) allows efficient transfer of execution context between the cores, thereby reducing the performance overhead of accessing small cores for helper execution. Our evaluation on a set of memory intensive programs chosen from the standard benchmark suites show that, helper threads using moderately sized small cores can significantly accelerate a larger core compared to using a hardware prefetcher alone. We find that a small core provides a good trade-off against using an equivalent large core to run helper threads in a HMC. Additionally, helper prefetching on small cores when used along with hardware prefetching, can provide an alternate design point to growing instruction window size for achieving higher sequential performance on memory intensive applications.

Domaines

Informatique [cs]

Narasimha Swamy Bharath : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-01087752

Soumis le : mercredi 26 novembre 2014-15:43:49

Dernière modification le : jeudi 11 avril 2024-13:08:14

Dates et versions

hal-01087752 , version 1 (26-11-2014)

Identifiants

HAL Id : hal-01087752 , version 1
DOI : 10.1109/SBAC-PAD.2014.39

Citer

Bharath Narasimha Swamy, Alain Ketterlin, André Seznec. Hardware/Software Helper Thread Prefetching On Heterogeneous Many Cores. 2014 IEEE 26th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD), Oct 2014, Paris, France. ⟨10.1109/SBAC-PAD.2014.39⟩. ⟨hal-01087752⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

INSTITUT-TELECOM EC-PARIS UNIV-RENNES1 CNRS INRIA INSA-RENNES ENGEES IRISA INSA-STRASBOURG IRISA-D3 INRIA2 INC-CNRS UR1-MATH-STIC UR1-UFR-ISTIC SITE-ALSACE UNIV-RENNES INSA-GROUPE UR1-MATH-NUM

277 Consultations

0 Téléchargements

Hardware/Software Helper Thread Prefetching On Heterogeneous Many Cores

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager