Break Down GPU Execution Time with an Analytical Method

Junjie Lai; André Seznec

doi:10.1145/2162131.2162136

Communication Dans Un Congrès Année : 2012

Break Down GPU Execution Time with an Analytical Method

(1) , (1)

Junjie Lai

Fonction : Auteur
PersonId : 913983

Amdahl's Law is Forever

André Seznec

Fonction : Auteur
PersonId : 13729
IdHAL : andre-seznec
ORCID : 0000-0002-3058-6503
IdRef : 033236402

Amdahl's Law is Forever

Résumé

Because modern GPGPU can provide significant computing power and has very high memory bandwidth, and also, developer-friendly programming interfaces such as CUDA have been introduced, GPGPU becomes more and more accepted in the HPC research area. Much research has been done to help developers to better optimize GPU applications. But to fully understand GPU performance behavior remains a hot research topic. We developed an analytical tool called TEG (Timing Estimation tool for GPU) to estimate GPU performance. Previ- ous work shows that TEG has good approximation and can help us to quantify bottlenecks' performance effects. We have made some improvement to the tool and in this paper, we use TEG to analyze the GPU performance scaling be- havior. TEG takes the dis-assembly output of CUDA kernel binary code and instruction trace as input. It does not ex- ecute the codes, but try to model the execution of CUDA codes with timing information. Because TEG takes the native GPU assembly code as input, it can estimate the execution time with a small error and it allows us to get more insight into GPU performance result.

Junjie Lai : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-00764874

Soumis le : jeudi 13 décembre 2012-15:18:34

Dernière modification le : vendredi 24 mars 2023-14:52:56

Dates et versions

hal-00764874 , version 1 (13-12-2012)

Identifiants

HAL Id : hal-00764874 , version 1
DOI : 10.1145/2162131.2162136

Citer

Junjie Lai, André Seznec. Break Down GPU Execution Time with an Analytical Method. Rapido '12, Jan 2012, Paris, France. ⟨10.1145/2162131.2162136⟩. ⟨hal-00764874⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

INSTITUT-TELECOM EC-PARIS UNIV-RENNES1 CNRS INRIA INSA-RENNES IRISA IRISA-D3 INRIA2 UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES ANR UR1-MATH-NUM

265 Consultations

0 Téléchargements

Break Down GPU Execution Time with an Analytical Method

Résumé

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager