Fast Approximation Algorithms for Task-Based Runtime Systems

Olivier Beaumont 1 Lionel Eyraud-Dubois 1 Suraj Kumar 2
1 Realopt - Reformulations based algorithms for Combinatorial Optimization
LaBRI - Laboratoire Bordelais de Recherche en Informatique, IMB - Institut de Mathématiques de Bordeaux, Inria Bordeaux - Sud-Ouest
2 STORM - STatic Optimizations, Runtime Methods
LaBRI - Laboratoire Bordelais de Recherche en Informatique, Inria Bordeaux - Sud-Ouest
Abstract : In High Performance Computing, heterogeneity is now the norm with specialized accelerators like GPUs providing efficient computational power. Resulting complexity led to the development of task-based runtime systems, where complex computations are described as task graphs, and scheduling decisions are made at run-time to perform load balancing between all resources of the platforms. Developing good scheduling strategies, even at the scale of a single node, and analyzing them both theoretically and in practice is expected to have a very high impact on the performance of current HPC systems. The special case of two kinds of resources, typically CPUs and GPUs is already of great practical interest. The scheduling policy Hetero-Prio has been proposed in the context of fast multipole computations (FMM), and has been extended to general task graphs with very promising results. In this paper, we provide a theoretical study of the performance of HeteroPrio, by proving approximation bounds compared to the optimal schedule, both in the case of independent tasks and in the case of general task graphs. Interestingly, our results establish that spoliation (a technique that enables resources to restart uncompleted tasks on another resource) is enough to prove bounded approximation ratios for a list scheduling algorithm on two unrelated resources, which is known to be impossible otherwise. This result holds true both for independent and dependent tasks graphs. Additionally, we provide an experimental evaluation of HeteroPrio on real task graphs from dense linear algebra computation, that establishes its strong performance in practice.
Complete list of metadatas

Cited literature [30 references]  Display  Hide  Download

https://hal.inria.fr/hal-01878606
Contributor : Lionel Eyraud-Dubois <>
Submitted on : Friday, September 21, 2018 - 11:40:44 AM
Last modification on : Tuesday, May 7, 2019 - 11:42:03 AM
Long-term archiving on : Saturday, December 22, 2018 - 2:57:05 PM

File

heteroPrioApproxProofsJournal....
Files produced by the author(s)

Identifiers

Citation

Olivier Beaumont, Lionel Eyraud-Dubois, Suraj Kumar. Fast Approximation Algorithms for Task-Based Runtime Systems. Concurrency and Computation: Practice and Experience, Wiley, 2018, 30 (17), ⟨10.1002/cpe.4502⟩. ⟨hal-01878606⟩

Share

Metrics

Record views

285

Files downloads

67