TEG: GPU Performance Estimation Using a Timing Model

Junjie Lai; André Seznec

Rapport (Rapport De Recherche) Année : 2011

TEG: GPU Performance Estimation Using a Timing Model

(1) , (1)

Junjie Lai

Fonction : Auteur
PersonId : 913983

Amdahl's Law is Forever

André Seznec

Fonction : Auteur
PersonId : 13729
IdHAL : andre-seznec
ORCID : 0000-0002-3058-6503
IdRef : 033236402

Amdahl's Law is Forever

Résumé

Modern Graphic Processing Units (GPUs) offer significant performance speedup over conventional processors. Programming on GPU for general purpose applications has become an important research area. CUDA programming model provides a C-like interface and is widely accepted. However, since hardware vendors do not disclose enough underlying architecture details, programmers have to optimize their applications without fully understanding the performance characteristics. In this paper we present a GPU timing model to provide more insights into the applications' performance on GPU. A GPU CUDA program timing estimation tool (TEG) is developed based on the GPU timing model. Especially, TEG illustrates how performance scales from one warp (CUDA thread group) to multiple concurrent warps on SM (Streaming Multiprocessor). Because TEG takes the native GPU assembly code as input, it allows to estimate the execution time with only a small error. TEG can help programmers to better understand the performance results and quantify bottlenecks' performance effects.

Domaines

Performance et fiabilité [cs.PF]

Fichier principal

RR-7804.pdf (1.09 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Junjie Lai : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-00641726

Soumis le : mercredi 16 novembre 2011-15:03:10

Dernière modification le : vendredi 24 mars 2023-14:52:55

Archivage à long terme le : lundi 5 décembre 2016-04:14:30

Dates et versions

hal-00641726 , version 1 (16-11-2011)

Identifiants

HAL Id : hal-00641726 , version 1

Citer

Junjie Lai, André Seznec. TEG: GPU Performance Estimation Using a Timing Model. [Research Report] RR-7804, INRIA. 2011. ⟨hal-00641726⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

INSTITUT-TELECOM EC-PARIS UNIV-RENNES1 CNRS INRIA INSA-RENNES IRISA INRIA-RRRT IRISA-D3 INRIA2 LARA UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES ANR UR1-MATH-NUM

421 Consultations

611 Téléchargements

TEG: GPU Performance Estimation Using a Timing Model

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager