Dynamic Memory-Aware Task-Tree Scheduling - Archive ouverte HAL Access content directly
Conference Papers Year : 2017

Dynamic Memory-Aware Task-Tree Scheduling

(1, 2) , (3) , (3, 4)
1
2
3
4

Abstract

Factorizing sparse matrices using direct multi-frontal methods generates directed tree-shaped task graphs, where edges represent data dependency between tasks. This paper revisits the execution of tree-shaped task graphs using multiple processors that share a bounded memory. A task can only be executed if all its input and output data can fit into the memory. The key difficulty is to manage the order of the task executions so that we can achieve high parallelism while staying below the memory bound. In particular, because input data of unprocessed tasks must be kept in memory, a bad scheduling strategy might compromise the termination of the algorithm. In the single processor case, solutions that are guaranteed to be below a memory bound are known. The multi-processor case (when one tries to minimize the total completion time) has been shown to be NP-complete. We present in this paper a novel heuristic solution that has a low complexity and is guaranteed to complete the tree within a given memory bound. We compare our algorithm to state of the art strategies, and observe that on both actual execution trees and synthetic trees, we always perform better than these solutions, with average speedups between 1.25 and 1.45 on actual assembly trees. Moreover, we show that the overhead of our algorithm is negligible even on deep trees, and would allow its runtime execution.
Fichier principal
Vignette du fichier
main.pdf (823.82 Ko) Télécharger le fichier
Origin : Files produced by the author(s)
Loading...

Dates and versions

hal-01472062 , version 1 (20-02-2017)

Identifiers

  • HAL Id : hal-01472062 , version 1

Cite

Guillaume Aupy, Clément Brasseur, Loris Marchal. Dynamic Memory-Aware Task-Tree Scheduling. IPDPS 2017 - 31st IEEE International Parallel & Distributed Processing Symposium, May 2017, Orlando, United States. pp.10. ⟨hal-01472062⟩
729 View
194 Download

Share

Gmail Facebook Twitter LinkedIn More