Global Memory Access Modelling for Efficient Implementation of the Lattice Boltzmann Method on Graphics Processing Units

Christian Obrecht 1 Frédéric Kuznik 1 Bernard Tourancheau 2, 3, 4 Jean-Jacques Roux 1
2 SWING - Smart Wireless Networking
Inria Grenoble - Rhône-Alpes, CITI - CITI Centre of Innovation in Telecommunications and Integration of services
4 GRAAL - Algorithms and Scheduling for Distributed Heterogeneous Platforms
Inria Grenoble - Rhône-Alpes, LIP - Laboratoire de l'Informatique du Parallélisme
Abstract : In this work, we investigate the global memory access mech- anism on recent GPUs. For the purpose of this study, we created spe- cific benchmark programs, which allowed us to explore the scheduling of global memory transactions. Thus, we formulate a model capable of estimating the execution time for a large class of applications. Our main goal is to facilitate optimisation of regular data-parallel applications on GPUs. As an example, we finally describe our CUDA implementations of LBM flow solvers on which our model was able to estimate performance with less than 5% relative error.
Type de document :
Communication dans un congrès
J.M.L.M. Palma et al. VECPAR, 2011, Porto, Portugal. Springer, 6449, pp.151--161, 2011, LNCS
Liste complète des métadonnées

Littérature citée [12 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/inria-00563159
Contributeur : Bernard Tourancheau <>
Soumis le : vendredi 4 février 2011 - 10:23:24
Dernière modification le : mardi 16 janvier 2018 - 15:43:07
Document(s) archivé(s) le : jeudi 5 mai 2011 - 02:50:25

Fichier

obrecht11a.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : inria-00563159, version 1

Collections

Citation

Christian Obrecht, Frédéric Kuznik, Bernard Tourancheau, Jean-Jacques Roux. Global Memory Access Modelling for Efficient Implementation of the Lattice Boltzmann Method on Graphics Processing Units. J.M.L.M. Palma et al. VECPAR, 2011, Porto, Portugal. Springer, 6449, pp.151--161, 2011, LNCS. 〈inria-00563159〉

Partager

Métriques

Consultations de la notice

247

Téléchargements de fichiers

249