Time-domain BEM for the wave equation on distributed-heterogeneous architectures: A blocking approach

Abstract : The problem of time-domain BEM for the wave equation in acoustics and electromagnetism can be expressed as a sparse linear system composed of multiple interaction/convolution matrices. It can be solved by using sparse matrix-vector products which are inefficient to achieve high Flop-rate neither on CPUs nor GPUs. In this paper we extend the approach proposed in a previous work [1] in which we re-order the computation to get a special matrix structure with one dense vector per row. This new structure is called a slice matrix and is computed with a custom matrix/vector product operator. In this study, we present an optimized implementation of this operator on Nvidia GPUs based on two blocking strategies. We explain how we can obtain multiple block-values from a slice and how these can be computed efficiently on GPUs since we target heterogeneous nodes composed of CPUs and GPUs. In order to deal with different efficiencies of the processing units we use a greedy heuristic that dynamically balances work among the workers. We demonstrate the performance of our system by studying the quality of the balancing heuristic and the sequential Flop-rate of the blocked implementations. Finally, we validate our implementation with an industrial test case on 8 heterogeneous nodes, each composed of 12 CPUs and 3 GPUs.
Liste complète des métadonnées

https://hal.inria.fr/hal-01200727
Contributeur : Bérenger Bramas <>
Soumis le : jeudi 17 septembre 2015 - 09:53:11
Dernière modification le : jeudi 11 janvier 2018 - 06:22:35

Identifiants

Collections

Citation

Bérenger Bramas, Olivier Coulaud, Guillaume Sylvand. Time-domain BEM for the wave equation on distributed-heterogeneous architectures: A blocking approach. Parallel Computing, Elsevier, 2015, pp.66-82. 〈http://www.sciencedirect.com/science/article/pii/S0167819115001088〉. 〈10.1016/j.parco.2015.07.005〉. 〈hal-01200727〉

Partager

Métriques

Consultations de la notice

242