TEG: GPU Performance Estimation Using a Timing Model

Junjie Lai 1 André Seznec 1
1 ALF - Amdahl's Law is Forever
Inria Rennes – Bretagne Atlantique , IRISA-D3 - ARCHITECTURE
Abstract : Modern Graphic Processing Units (GPUs) offer significant performance speedup over conventional processors. Programming on GPU for general purpose applications has become an important research area. CUDA programming model provides a C-like interface and is widely accepted. However, since hardware vendors do not disclose enough underlying architecture details, programmers have to optimize their applications without fully understanding the performance characteristics. In this paper we present a GPU timing model to provide more insights into the applications' performance on GPU. A GPU CUDA program timing estimation tool (TEG) is developed based on the GPU timing model. Especially, TEG illustrates how performance scales from one warp (CUDA thread group) to multiple concurrent warps on SM (Streaming Multiprocessor). Because TEG takes the native GPU assembly code as input, it allows to estimate the execution time with only a small error. TEG can help programmers to better understand the performance results and quantify bottlenecks' performance effects.
Document type :
Complete list of metadatas

Cited literature [10 references]  Display  Hide  Download

Contributor : Junjie Lai <>
Submitted on : Wednesday, November 16, 2011 - 3:03:10 PM
Last modification on : Thursday, November 15, 2018 - 11:57:43 AM
Long-term archiving on : Monday, December 5, 2016 - 4:14:30 AM


Files produced by the author(s)


  • HAL Id : hal-00641726, version 1


Junjie Lai, André Seznec. TEG: GPU Performance Estimation Using a Timing Model. [Research Report] RR-7804, INRIA. 2011. ⟨hal-00641726⟩



Record views


Files downloads