A generic scheduler to foster data locality for GPU and out-of-core task-based applications - Inria - Institut national de recherche en sciences et technologies du numérique Access content directly
Preprints, Working Papers, ... Year : 2023

A generic scheduler to foster data locality for GPU and out-of-core task-based applications

Abstract

Hardware accelerators, such as GPUs, now provide a large part of the computational power used for scientific simulations. GPUs come with their own (limited) memory and are connected to the main memory of the machine via a bus with limited bandwidth. Scientific simulations often operate on very large data, to the point of not fitting in the limited GPU memory. In this case, one has to turn to out-of-core computing: data are kept in the CPU memory, and moved back and forth to the GPU memory when needed for the computation. This out-of-core situation also happens when processing on multicore CPUs with limited memory huge datasets stored on disk. In both cases, data movement quickly becomes a performance bottleneck. Task-based runtime schedulers have emerged as a convenient and efficient way to manage large applications on such heterogeneous platforms. They are in charge of choosing which tasks to assign on which processing unit and in which order they should be processed. In this work, we focus on this problem of scheduling for a taskbased runtime to improve data locality in an out-of-core setting, in order to reduce data movements. We design a data-aware strategy for both task scheduling and data eviction from limited memories. We compare this to existing scheduling techniques in runtime systems. Using the StarPU runtime, we show that our strategy achieves significantly better performance when scheduling tasks on multiple GPUs with limited memory, as well as on multiple CPU cores with limited main memory.
Fichier principal
Vignette du fichier
memory-aware-scheduling.pdf (1021.21 Ko) Télécharger le fichier
Origin : Files produced by the author(s)

Dates and versions

hal-04146714 , version 1 (30-06-2023)

Licence

Attribution

Identifiers

  • HAL Id : hal-04146714 , version 1

Cite

Maxime Gonthier, Samuel Thibault, Loris Marchal. A generic scheduler to foster data locality for GPU and out-of-core task-based applications. 2023. ⟨hal-04146714⟩
78 View
65 Download

Share

Gmail Facebook X LinkedIn More