Efficient Code Optimization Technique for Itanium2 Cache System and Scientific Computing - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 2003

Efficient Code Optimization Technique for Itanium2 Cache System and Scientific Computing

Résumé

To keep up with a large degree of ILP, Itanium2 L2 cache system uses a complex organization scheme: load/store queues, banking and interleaving. In this paper, we study the impact of this cache system on memory instruction scheduling. We demonstrate that for scientific codes, "memory access vectorization" allows to generate very efficient code (up to the maximum of 4 loads per cycle). The impact of such "vectorization" on register pressure is analyzed: various register allocation schemes are proposed and evaluated.

Domaines

Autre [cs.OH]
Fichier principal
Vignette du fichier
Efficient_Code-2.pdf (178.46 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-00647124 , version 1 (01-12-2011)

Identifiants

  • HAL Id : hal-00647124 , version 1

Citer

William Jalby, Christophe Lemuet, Sid Touati. Efficient Code Optimization Technique for Itanium2 Cache System and Scientific Computing. Workshop on Compilers for Parallel Computers, Jan 2003, Amsterdam, Netherlands. ⟨hal-00647124⟩

Collections

CNRS UVSQ
59 Consultations
70 Téléchargements

Partager

Gmail Facebook X LinkedIn More