Optimizing 3D Convolutions for Wavelet Transforms on CPUs with SSE Units and GPUs - Archive ouverte HAL Access content directly
Reports (Research Report) Year : 2013

Optimizing 3D Convolutions for Wavelet Transforms on CPUs with SSE Units and GPUs

(1) , (1) , (2) , (2)
1
2

Abstract

Nanosimulations present a big HPC challenge as they present increasing performance demands in heterogeneous execution environments. In this paper, we present our optimization methodology for BigDFT, a nanosimulation software using Density Functional Theory. We explore autotuning possibilities for BigDFT's 3D convolutions by studying optimization techniques for several architectures. Namely, we focus on processors with vector units and on GPU acceleration. We report on the portability and the performance gains of our approach (speedup x2 on CPU, x5 on GPU) and discuss the relation between algorithmic specifics, architecture and performance.
Fichier principal
Vignette du fichier
RR-LIG-032.pdf (8.17 Mo) Télécharger le fichier
Origin : Files produced by the author(s)
Loading...

Dates and versions

hal-00953056 , version 1 (28-02-2014)

Identifiers

  • HAL Id : hal-00953056 , version 1

Cite

Brice Videau, Vania Marangozova-Martin, Luigi Genovese, Thierry Deutsch. Optimizing 3D Convolutions for Wavelet Transforms on CPUs with SSE Units and GPUs. [Research Report] RR-LIG-032, 2013. ⟨hal-00953056⟩
165 View
251 Download

Share

Gmail Facebook Twitter LinkedIn More