Skip to Main content Skip to Navigation
New interface
Conference papers

Generic algorithmic scheme for 2D stencil applications on hybrid machines

Abstract : Hardware accelerators are classic scientific coprocessors in HPC machines. However, the number of CPU cores on the mother board is increasing and constitutes a non negligible part of the total computing power of the machine. So, running an application both on an accelerator (like a GPU or a Xeon-Phi device) and on the CPU cores can provide the highest performance. Moreover, it is now possible to include different accelerators in a machine, in order to support and to speedup a larger set of applications. Then, running an application part on the most suitable device allows to reach high performance, but using all unused devices in the machine should permit to improve even more the performance of that part. However, the overlapping of computations with inter-device data transfers is mandatory to limit the overhead of this approach, leading to complex asynchronous algorithms and multi-paradigm optimized codes. This article introduces our research and experiments on cooperation between several CPU and both a GPU and a Xeon-Phi accelerators, all included in a same machine.
Complete list of metadata
Contributor : Sylvain Contassot-Vivier Connect in order to contact the contributor
Submitted on : Wednesday, January 27, 2016 - 3:38:25 PM
Last modification on : Wednesday, March 9, 2022 - 3:10:50 AM


  • HAL Id : hal-01263242, version 1


Stéphane Vialle, Sylvain Contassot-Vivier, Patrick Mercier. Generic algorithmic scheme for 2D stencil applications on hybrid machines. ARCS 2016 - Architecture of Computing Systems , Apr 2016, Nuremberg, Germany. ⟨hal-01263242⟩



Record views