Automatic Skleton-Driven Memory Affinity for Transactional Worklist Applications

Luís Fabrício Góes; Christiane Pousa Ribeiro; Marcio Bastos Castro; Jean-François Mehaut; Murray Cole; Marcelo Cintra

doi:10.1007/s10766-013-0253-x

Article Dans Une Revue International Journal of Parallel Programming Année : 2014

Automatic Skleton-Driven Memory Affinity for Transactional Worklist Applications

(1) , (2) , (3) , (4) , (5) , (5)

1
2
3
4
5

Luís Fabrício Góes

Fonction : Auteur

Pontifical Catholic University of Minas Gerais [Belo Horizonte]

Christiane Pousa Ribeiro

Fonction : Auteur
PersonId : 856332

Universität Zürich [Zürich] = University of Zurich

Marcio Bastos Castro

Fonction : Auteur

Laboratoire d'Informatique de Grenoble

Jean-François Mehaut

Fonction : Auteur
PersonId : 6046
IdHAL : jean-francois-mehaut
ORCID : 0000-0003-1047-7462
IdRef : 086451227

Compiler Optimization and Run-time Systems

Murray Cole

Fonction : Auteur

Institute for Computing Systems Architecture School of Informatics - University of Edinburgh

Marcelo Cintra

Fonction : Auteur

Institute for Computing Systems Architecture School of Informatics - University of Edinburgh

Résumé

Memory affinity has become a key element to achieve scalable performance on multi-core platforms. Mechanisms such as thread scheduling, page allocation and cache prefetching are commonly employed to enhance memory affinity which keeps data close to the cores that access it. In particular, software transactional memory (STM) applications exhibit irregular memory access behavior that makes harder to determine which and when data will be needed by each core. Additionally, existing STM runtime systems are decoupled from issues such as thread and memory management. In this paper, we thus propose a skeleton-driven mechanism to improve memory affinity on STM applications that fit the worklist pattern employing a two-level approach. First, it addresses memory affinity in the DRAM level by automatic selecting page allocation policies. Then it employs data prefetching helper threads to improve affinity in the cache level. It relies on a skeleton framework to exploit the application pattern in order to provide automatic memory page allocation and cache prefetching. Our experimental results on the STAMP benchmark suite show that our proposed mechanism can achieve performance improvements of up to 46 %, with an average of 11 %, over a baseline version on two NUMA multi-core machines.

Mots clés

Memory affinity Software transactional memory Parallel algorithmic skeleton Multi-core platforms

Domaines

Calcul parallèle, distribué et partagé [cs.DC]

Gwenaël Delaval : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-00953110

Soumis le : vendredi 28 février 2014-10:41:19

Dernière modification le : jeudi 4 avril 2024-18:20:25

Dates et versions

hal-00953110 , version 1 (28-02-2014)

Identifiants

HAL Id : hal-00953110 , version 1
DOI : 10.1007/s10766-013-0253-x

Citer

Luís Fabrício Góes, Christiane Pousa Ribeiro, Marcio Bastos Castro, Jean-François Mehaut, Murray Cole, et al.. Automatic Skleton-Driven Memory Affinity for Transactional Worklist Applications. International Journal of Parallel Programming, 2014, 42 (2), pp.365-382. ⟨10.1007/s10766-013-0253-x⟩. ⟨hal-00953110⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UGA CNRS INRIA LIG LIG_SRCPR PERSYVAL-LAB INRIA2 LIG-SRCPR-CORSE POLYTECH-GRENOBLE ANR LIG_SIDCH

294 Consultations

0 Téléchargements

Automatic Skleton-Driven Memory Affinity for Transactional Worklist Applications

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager