On the Use of Small 2D Convolutions on GPUs

Shams A.H. Al Umairy; Alexander S. van Amesfoort; Irwan D. Setija; Martijn C. van Beurden; Henk J. Sips

Communication Dans Un Congrès Année : 2010

On the Use of Small 2D Convolutions on GPUs

(1) , (1) , (2) , (3) , (1)

1
2
3

Shams A.H. Al Umairy

Fonction : Auteur

Delft University of Technology

Alexander S. van Amesfoort

Fonction : Auteur

Delft University of Technology

Irwan D. Setija

Fonction : Auteur

ASML [VELDHOVEN]

Martijn C. van Beurden

Fonction : Auteur

Eindhoven University of Technology [Eindhoven]

Henk J. Sips

Fonction : Auteur

Delft University of Technology

Résumé

Computing many small 2D convolutions using FFTs is a basis for a large number of applications in many domains in science and engineering, among them electromagnetic diffraction modeling in physics. The GPU architecture seems to be a suitable architecture to accelerate these convolutions, but reaching high application performance requires substantial development time and non-portable optimizations. In this work, we present the techniques, performance results and considerations to accelerate small 2D convolutions using CUDA, and compare performance to a multi-threaded CPU implementation. To improve programmability and performance of applications that make heavy use of small convolutions, we argue that two improvements to software and hardware are needed: FFT libraries must be extended with a single convolution function and communication bandwidth between CPU and GPU needs to be drastically improved.

Domaines

Architectures Matérielles [cs.AR]

Fichier principal

A4MMC-al-umairy.pdf (202.58 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Ist Rennes : Connectez-vous pour contacter le contributeur

https://inria.hal.science/inria-00493873

Soumis le : lundi 21 juin 2010-15:10:44

Dernière modification le : mardi 26 mars 2024-17:44:13

Archivage à long terme le : mercredi 22 septembre 2010-18:08:40

Dates et versions

inria-00493873 , version 1 (21-06-2010)

Identifiants

HAL Id : inria-00493873 , version 1

Citer

Shams A.H. Al Umairy, Alexander S. van Amesfoort, Irwan D. Setija, Martijn C. van Beurden, Henk J. Sips. On the Use of Small 2D Convolutions on GPUs. A4MMC 2010 - 1st Workshop on Applications for Multi and Many Core Processors, Jun 2010, Saint Malo, France. ⟨inria-00493873⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

ISCA2010 A4MMC

60 Consultations

735 Téléchargements

On the Use of Small 2D Convolutions on GPUs

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager