hal-00746089, version 2
Optimized M2L Kernels for the Chebyshev Interpolation based Fast Multipole Method
Matthias Messner
1Bérenger Bramas 1Olivier Coulaud 1Eric Darve a, 2, 3
(2012)
Résumé : A fast multipole method (FMM) for asymptotically smooth kernel functions (1/r, 1/r^4, Gauss and Stokes kernels, radial basis functions, etc.) based on a Chebyshev interpolation scheme has been introduced in [Fong et al., 2009]. The method has been extended to oscillatory kernels (e.g., Helmholtz kernel) in [Messner et al., 2012]. Beside its generality this FMM turns out to be favorable due to its easy implementation and its high performance based on intensive use of highly optimized BLAS libraries. However, one of its bottlenecks is the precomputation of the multiple-to-local (M2L) operator, and its higher number of floating point operations (flops) compared to other FMM formulations. Here, we present several optimizations for that operator, which is known to be the costliest FMM operator. The most efficient ones do not only reduce the precomputation time by a factor up to 340 but they also speed up the matrix-vector product. We conclude with comparisons and numerical validations of all presented optimizations.
- a – Stanford University
- 1 : HiePACS (INRIA Bordeaux - Sud-Ouest)
- INRIA – Université de Bordeaux – CNRS : UMR5800 – CERFACS
- 2 : Mechanical Engineering Department
- Stanford University
- 3 : Institute for Computational and Mathematical Engineering (iCME)
- Stanford University
- Domaine : Informatique/Ingénierie, finance et science
Informatique/Logiciel mathématique
Mathématiques/Analyse numérique - Mots-clés : Fast Multipole Method – asymptotically smooth kernels – oscillatory kernels – black-box method – Chebyshev interpolation
- Versions disponibles : v1 (27-10-2012) v2 (20-11-2012)
- hal-00746089, version 2
- http://hal.inria.fr/hal-00746089
- oai:hal.inria.fr:hal-00746089
- Contributeur : Matthias Messner
- Soumis le : Mardi 20 Novembre 2012, 01:44:16
- Dernière modification le : Mardi 20 Novembre 2012, 18:46:12






Documents associés

Exporter