Geometric Deep Reinforcement Learning for Dynamic DAG Scheduling

Nathan Grinsztajn; Olivier Beaumont; Emmanuel Jeannot; Philippe Preux

Communication Dans Un Congrès Année : 2020

Geometric Deep Reinforcement Learning for Dynamic DAG Scheduling

(1) , (2) , (3) , (1)

1
2
3

Nathan Grinsztajn

Fonction : Auteur
PersonId : 1083320

Scool

Olivier Beaumont

Fonction : Auteur
PersonId : 181224
IdHAL : olivier-beaumont
ORCID : 0000-0003-2741-6228
IdRef : 124577083

High-End Parallel Algorithms for Challenging Numerical Simulations

Emmanuel Jeannot

Fonction : Auteur
PersonId : 15678
IdHAL : emmanuel-jeannot
ORCID : 0000-0002-3956-2997
IdRef : 084595108

Topology-Aware System-Scale Data Management for High-Performance Computing

Philippe Preux

Fonction : Auteur
PersonId : 5488
IdHAL : preux-philippe
IdRef : 059896353

Scool

Résumé

In practice, it is quite common to face combinatorial optimization problems which contain uncertainty along with non-determinism and dynamicity. These three properties call for appropriate algorithms; reinforcement learning (RL) is dealing with them in a very natural way. Today, despite some efforts, most real-life combinatorial optimization problems remain out of the reach of reinforcement learning algorithms. In this paper, we propose a reinforcement learning approach to solve a realistic scheduling problem, and apply it to an algorithm commonly executed in the high performance computing community, the Cholesky factorization. On the contrary to static scheduling, where tasks are assigned to processors in a predetermined ordering before the beginning of the parallel execution, our method is dynamic: task allocations and their execution ordering are decided at runtime, based on the system state and unexpected events, which allows much more flexibility. To do so, our algorithm uses graph neural networks in combination with an actor-critic algorithm (A2C) to build an adaptive representation of the problem on the fly. We show that this approach is competitive with state-of-the-art heuristics used in high-performance computing runtime systems. Moreover, our algorithm does not require an explicit model of the environment, but we demonstrate that extra knowledge can easily be incorporated and improves performance. We also exhibit key properties provided by this RL approach, and study its transfer abilities to other instances.

Mots clés

Reinforcement learning scheduling task graph DAG high performance computing combinatorial optimization Reinforcement learning

Domaines

Intelligence artificielle [cs.AI]

Fichier principal

HPC_ADPRL.pdf (370.88 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Nathan Grinsztajn : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-03028981

Soumis le : mardi 19 janvier 2021-21:42:18

Dernière modification le : mercredi 20 mars 2024-17:52:16

Archivage à long terme le : mardi 20 avril 2021-18:03:48

Dates et versions

hal-03028981 , version 1 (19-01-2021)

Identifiants

HAL Id : hal-03028981 , version 1
ARXIV : 2011.04333

Citer

Nathan Grinsztajn, Olivier Beaumont, Emmanuel Jeannot, Philippe Preux. Geometric Deep Reinforcement Learning for Dynamic DAG Scheduling. IEEE SSCI 2020 - Symposium Series on Computational Intelligence, Dec 2020, Canberra / Virtual, Australia. ⟨hal-03028981⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA CRISTAL INRIA2 UNIV-LILLE CRISTAL-SCOOL

198 Consultations

344 Téléchargements

Geometric Deep Reinforcement Learning for Dynamic DAG Scheduling

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager