Skip to Main content Skip to Navigation
Conference papers

Efficient Code Optimization Technique for Itanium2 Cache System and Scientific Computing

Abstract : To keep up with a large degree of ILP, Itanium2 L2 cache system uses a complex organization scheme: load/store queues, banking and interleaving. In this paper, we study the impact of this cache system on memory instruction scheduling. We demonstrate that for scientific codes, "memory access vectorization" allows to generate very efficient code (up to the maximum of 4 loads per cycle). The impact of such "vectorization" on register pressure is analyzed: various register allocation schemes are proposed and evaluated.
Document type :
Conference papers
Complete list of metadata

Cited literature [8 references]  Display  Hide  Download

https://hal.inria.fr/hal-00647124
Contributor : Sid Touati <>
Submitted on : Thursday, December 1, 2011 - 3:14:44 PM
Last modification on : Friday, January 10, 2020 - 3:42:21 PM
Long-term archiving on: : Friday, March 2, 2012 - 2:30:28 AM

File

Efficient_Code-2.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-00647124, version 1

Collections

Citation

William Jalby, Christophe Lemuet, Sid Touati. Efficient Code Optimization Technique for Itanium2 Cache System and Scientific Computing. Workshop on Compilers for Parallel Computers, Jan 2003, Amsterdam, Netherlands. ⟨hal-00647124⟩

Share

Metrics

Record views

188

Files downloads

107