Understanding G80 behavior and performances

Cyril Crassin 1 Fabrice Neyret 1
1 ARTIS - Acquisition, representation and transformations for image synthesis
Inria Grenoble - Rhône-Alpes, LJK - Laboratoire Jean Kuntzmann, INPG - Institut National Polytechnique de Grenoble
Abstract : Criticality of optimization on GPU is very different than for CPU due to the extremely high performance contrasts. GPUs have "fast paths": Unintuitive depressing 1000x slowdown as compared to expectations or after a small change are often met when programming. Conversely, knowing these fast paths can lead to more than 1000x speedup. G80 series can be even less intuitive regarding this issue since it can very efficiently hide some unoptimalities (via threading) up to the point where suddenly it can no longer. To find and remain in these fast paths, it is important to understand how the GPUs work and behave. In particular, we were interested in how fragments are scheduled among G80 stream processors.
Document type :
Reports
Liste complète des métadonnées

https://hal.inria.fr/hal-01643579
Contributor : Fabrice Neyret <>
Submitted on : Tuesday, November 21, 2017 - 3:02:44 PM
Last modification on : Wednesday, June 27, 2018 - 3:46:11 PM

File

Understanding G80 behavior.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01643579, version 1

Collections

Citation

Cyril Crassin, Fabrice Neyret. Understanding G80 behavior and performances. [Research Report] LJK-INRIA. 2008. ⟨hal-01643579⟩

Share

Metrics

Record views

358

Files downloads

62