Vectorization Technology To Improve Interpreter Performance

Erven Rohou; Kevin Williams; David Yuste

doi:10.1145/2400682.2400685

Journal Articles ACM Transactions on Architecture and Code Optimization Year : 2013

Vectorization Technology To Improve Interpreter Performance

(1) , (1) , (1)

Erven Rohou

Function : Author
PersonId : 176658
IdHAL : erven-rohou
ORCID : 0000-0002-8060-8360
IdRef : 135287065

Amdahl's Law is Forever

Kevin Williams

Function : Author

Amdahl's Law is Forever

David Yuste

Function : Author
PersonId : 885836

Amdahl's Law is Forever

Abstract

In the present computing landscape, interpreters are in use in a wide range of systems. Recent trends in consumer electronics have created a new category of portable, lightweight software applications. Typically, these applications have fast development cycles and short life spans. They run on a wide range of systems and are deployed in a target independent bytecode format over Internet and cellular networks. Their authors are untrusted third-party vendors, and they are executed in secure managed runtimes or virtual machines. Furthermore, due to security policies or development time constraints, these virtual machines often lack just-in-time compilers and rely on interpreted execution. At the other end of the spectrum, interpreters are also a reality in the field of high performance computations because of the flexibility they provide. The main performance penalty in interpreters arises from instruction dispatch. Each bytecode requires a minimum number of machine instructions to be executed. In this work, we introduce a novel approach for interpreter optimization that reduces instruction dispatch thanks to vectorization technology. We extend the split-compilation paradigm to interpreters, thus guaranteeing that our approach exhibits almost no overhead at run time.We take advantage of the vast research in vectorization and its presence in modern compilers. Complex analyses are performed ahead of time, and their results are conveyed to the executable bytecode. At run time, the interpreter retrieves this additional information to build the SIMD IR (intermediate representation) instructions that carry the vector semantics. The bytecode language remains unmodified, making this representation compatible with legacy interpreters, and previously proposed JIT compilers. We show that this approach drastically reduces the number of instructions to interpret and decreases execution time of vectorizable applications. Moreover, we map SIMD IR instructions to hardware SIMD instructions when available, with a substantial additional improvement. Finally, we finely analyze the impact of our extension on the behavior of the caches and branch predictors.

Keywords

Interpreters SIMD bytecode dispatch

Domains

Other [cs.OH]

Erven Rohou : Connect in order to contact the contributor

https://inria.hal.science/hal-00747072

Submitted on : Tuesday, October 30, 2012-2:27:36 PM

Last modification on : Friday, March 24, 2023-2:52:56 PM

Dates and versions

hal-00747072 , version 1 (30-10-2012)

Identifiers

HAL Id : hal-00747072 , version 1
DOI : 10.1145/2400682.2400685

Cite

Erven Rohou, Kevin Williams, David Yuste. Vectorization Technology To Improve Interpreter Performance. ACM Transactions on Architecture and Code Optimization, 2013, 9 (4), pp.26:1-26:22. ⟨10.1145/2400682.2400685⟩. ⟨hal-00747072⟩

Export

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

INSTITUT-TELECOM EC-PARIS UNIV-RENNES1 CNRS INRIA INSA-RENNES IRISA IRISA-D3 INRIA2 UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES UR1-MATH-NUM

241 View

0 Download

Vectorization Technology To Improve Interpreter Performance

Abstract

Keywords

Domains

Dates and versions

Identifiers

Cite

Export

Collections

Altmetric

Share