Challenges in Binary Translation for Desktop Supercomputing

Rodrigo Dominguez; David Kaeli

Document Associé À Des Manifestations Scientifiques Année : 2010

Challenges in Binary Translation for Desktop Supercomputing

(1) , (1)

Rodrigo Dominguez

Fonction : Auteur
PersonId : 872191

Northeastern University Computer Architecture Research Group

David Kaeli

Fonction : Auteur
PersonId : 872192

Northeastern University Computer Architecture Research Group

Résumé

Given the fact that the microprocessor industry has jumped off the frequency scaling bandwagon due to power issues, the rate of development of aggressive many-core systems has picked up in the past few years. However, these many-core systems have taken a range of incarnations: graphics processors (GPUs) have emerged as the data-parallel architecture of choice; heterogeneous architectures (e.g., AMD Fusion and IBM Cell) have emerged, though present challenges to code developers and compiler writers; and general purpose CPUs continue to increase the number of cores based on Moore's Law, though they lack the data-parallel horsepower found on other architectural styles. These many-core systems come equipped with a range of programming languages, runtime environments, and compiler frameworks. In the case of GPUs, the architecture of the programmable shader core continues to evolve today in an effort to adapt these graphics-oriented processors to the requirements of the scientific computing community. For this reason, both AMD and NVIDIA have defined intermediate representations (IRs) as part of their offerings. The main role of any IR is to provide a stable instruction set architecture that spans multiple microarchitecture generations, similar to the concept of bytecodes in Java. In the case of AMD/ATI GPUs, the IR is called the Intermediate Language (IL), and for NVIDIA GPUs, the IR is called Parallel Thread Execution (PTX). Our work explores the fundamental differences between PTX and IL and the benefits of each IR. We are enhancing a binary translation framework to translate PTX into IL, allowing applications compiled for NVIDIA's C for CUDA environment targeting NVIDIA GPUs to run on AMD/ATI GPUs. We will report on our results to date, and will shed some light on the challenges that lay before us.

Domaines

Architectures Matérielles [cs.AR]

Fichier principal

Dominguez-Kaeli-AMAS-BT2010.pdf (7.58 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Ist Rennes : Connectez-vous pour contacter le contributeur

https://inria.hal.science/inria-00492920

Soumis le : jeudi 17 juin 2010-14:15:28

Dernière modification le : lundi 20 juin 2016-14:10:32

Archivage à long terme le : lundi 20 septembre 2010-17:25:26

Dates et versions

inria-00492920 , version 1 (17-06-2010)

Identifiants

HAL Id : inria-00492920 , version 1

Citer

Rodrigo Dominguez, David Kaeli. Challenges in Binary Translation for Desktop Supercomputing. AMAS-BT - 3rd Workshop on Architectural and Microarchitectural Support for Binary Translation, Jun 2010, Saint Malo, France. ⟨inria-00492920⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

AMAS-BT2010 ISCA2010

50 Consultations

14 Téléchargements

Challenges in Binary Translation for Desktop Supercomputing

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager