Optimizing 3D Convolutions for Wavelet Transforms on CPUs with SSE Units and GPUs - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Rapport (Rapport De Recherche) Année : 2013

Optimizing 3D Convolutions for Wavelet Transforms on CPUs with SSE Units and GPUs

Résumé

Nanosimulations present a big HPC challenge as they present increasing performance demands in heterogeneous execution environments. In this paper, we present our optimization methodology for BigDFT, a nanosimulation software using Density Functional Theory. We explore autotuning possibilities for BigDFT's 3D convolutions by studying optimization techniques for several architectures. Namely, we focus on processors with vector units and on GPU acceleration. We report on the portability and the performance gains of our approach (speedup x2 on CPU, x5 on GPU) and discuss the relation between algorithmic specifics, architecture and performance.
Fichier principal
Vignette du fichier
RR-LIG-032.pdf (8.17 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-00953056 , version 1 (28-02-2014)

Identifiants

  • HAL Id : hal-00953056 , version 1

Citer

Brice Videau, Vania Marangozova-Martin, Luigi Genovese, Thierry Deutsch. Optimizing 3D Convolutions for Wavelet Transforms on CPUs with SSE Units and GPUs. [Research Report] RR-LIG-032, 2013. ⟨hal-00953056⟩
165 Consultations
264 Téléchargements

Partager

Gmail Facebook X LinkedIn More