Skip to Main content Skip to Navigation

Understanding G80 behavior and performances

Cyril Crassin 1 Fabrice Neyret 1
1 ARTIS - Acquisition, representation and transformations for image synthesis
Inria Grenoble - Rhône-Alpes, LJK - Laboratoire Jean Kuntzmann, Grenoble INP - Institut polytechnique de Grenoble - Grenoble Institute of Technology
Abstract : Criticality of optimization on GPU is very different than for CPU due to the extremely high performance contrasts. GPUs have "fast paths": Unintuitive depressing 1000x slowdown as compared to expectations or after a small change are often met when programming. Conversely, knowing these fast paths can lead to more than 1000x speedup. G80 series can be even less intuitive regarding this issue since it can very efficiently hide some unoptimalities (via threading) up to the point where suddenly it can no longer. To find and remain in these fast paths, it is important to understand how the GPUs work and behave. In particular, we were interested in how fragments are scheduled among G80 stream processors.
Document type :
Complete list of metadata
Contributor : Fabrice Neyret Connect in order to contact the contributor
Submitted on : Tuesday, November 21, 2017 - 3:02:44 PM
Last modification on : Saturday, November 6, 2021 - 2:32:06 PM


Understanding G80 behavior.pdf
Files produced by the author(s)


  • HAL Id : hal-01643579, version 1



Cyril Crassin, Fabrice Neyret. Understanding G80 behavior and performances. [Research Report] LJK-INRIA. 2008. ⟨hal-01643579⟩



Les métriques sont temporairement indisponibles