Skip to Main content Skip to Navigation
New interface
Journal articles

Time-domain BEM for the wave equation on distributed-heterogeneous architectures: A blocking approach

Abstract : The problem of time-domain BEM for the wave equation in acoustics and electromagnetism can be expressed as a sparse linear system composed of multiple interaction/convolution matrices. It can be solved by using sparse matrix-vector products which are inefficient to achieve high Flop-rate neither on CPUs nor GPUs. In this paper we extend the approach proposed in a previous work [1] in which we re-order the computation to get a special matrix structure with one dense vector per row. This new structure is called a slice matrix and is computed with a custom matrix/vector product operator. In this study, we present an optimized implementation of this operator on Nvidia GPUs based on two blocking strategies. We explain how we can obtain multiple block-values from a slice and how these can be computed efficiently on GPUs since we target heterogeneous nodes composed of CPUs and GPUs. In order to deal with different efficiencies of the processing units we use a greedy heuristic that dynamically balances work among the workers. We demonstrate the performance of our system by studying the quality of the balancing heuristic and the sequential Flop-rate of the blocked implementations. Finally, we validate our implementation with an industrial test case on 8 heterogeneous nodes, each composed of 12 CPUs and 3 GPUs.
Complete list of metadata
Contributor : Bérenger Bramas Connect in order to contact the contributor
Submitted on : Thursday, September 17, 2015 - 9:53:11 AM
Last modification on : Saturday, June 25, 2022 - 7:41:21 PM




Bérenger Bramas, Olivier Coulaud, Guillaume Sylvand. Time-domain BEM for the wave equation on distributed-heterogeneous architectures: A blocking approach. Parallel Computing, 2015, 49, pp.66-82. ⟨10.1016/j.parco.2015.07.005⟩. ⟨hal-01200727⟩



Record views