Skip to Main content Skip to Navigation
Conference papers

Generic algorithmic scheme for 2D stencil applications on hybrid machines

Abstract : Hardware accelerators are classic scientific coprocessors in HPC machines. However, the number of CPU cores on the mother board is increasing and constitutes a non negligible part of the total computing power of the machine. So, running an application both on an accelerator (like a GPU or a Xeon-Phi device) and on the CPU cores can provide the highest performance. Moreover, it is now possible to include different accelerators in a machine, in order to support and to speedup a larger set of applications. Then, running an application part on the most suitable device allows to reach high performance, but using all unused devices in the machine should permit to improve even more the performance of that part. However, the overlapping of computations with inter-device data transfers is mandatory to limit the overhead of this approach, leading to complex asynchronous algorithms and multi-paradigm optimized codes. This article introduces our research and experiments on cooperation between several CPU and both a GPU and a Xeon-Phi accelerators, all included in a same machine.
Complete list of metadatas

https://hal.inria.fr/hal-01263242
Contributor : Sylvain Contassot-Vivier <>
Submitted on : Wednesday, January 27, 2016 - 3:38:25 PM
Last modification on : Wednesday, September 16, 2020 - 10:43:05 AM

Identifiers

  • HAL Id : hal-01263242, version 1

Citation

Stéphane Vialle, Sylvain Contassot-Vivier, Patrick Mercier. Generic algorithmic scheme for 2D stencil applications on hybrid machines. ARCS 2016 - Architecture of Computing Systems , Apr 2016, Nuremberg, Germany. ⟨hal-01263242⟩

Share

Metrics

Record views

545