Skip to Main content Skip to Navigation
Preprints, Working Papers, ...

Using Static Allocation Algorithms for Matrix Matrix Multiplication on Multicores and GPUs

Lionel Eyraud-Dubois 1, 2 Thomas Lambert 1, 3
1 Realopt - Reformulations based algorithms for Combinatorial Optimization
LaBRI - Laboratoire Bordelais de Recherche en Informatique, IMB - Institut de Mathématiques de Bordeaux, Inria Bordeaux - Sud-Ouest
Abstract : We consider how to allocate data when performing matrix multiplication on a heterogeneous node, with multicores and GPUs. Classical (cyclic) allocations designed for homogeneous settings are not appropriate, but the advent of task-based runtime systems makes it possible to use more general allocations. Previous theoretical work has proposed square and cube partitioning algorithms aimed at minimizing data movement for matrix multiplication. We propose techniques to adapt these continuous square partitionings to allocating discrete tiles of a matrix, and strategies to adapt the static allocation at run-time. We use these techniques in an implementation of Matrix Multiplication based on the StarPU runtime system, and we show through extensive experiments that this implementation allows to consistently obtain a lower communication volume while improving slightly the execution time, compared to standard state-of-the-art dynamic strategies.
Document type :
Preprints, Working Papers, ...
Complete list of metadatas

Cited literature [19 references]  Display  Hide  Download

https://hal.inria.fr/hal-01670678
Contributor : Lionel Eyraud-Dubois <>
Submitted on : Friday, December 22, 2017 - 1:03:16 PM
Last modification on : Tuesday, May 7, 2019 - 11:42:10 AM
Long-term archiving on: : Friday, March 23, 2018 - 12:16:45 PM

File

ipdps.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01670678, version 1

Collections

Citation

Lionel Eyraud-Dubois, Thomas Lambert. Using Static Allocation Algorithms for Matrix Matrix Multiplication on Multicores and GPUs. 2017. ⟨hal-01670678v1⟩

Share

Metrics

Record views

194

Files downloads

85