Non-Markovian Reinforcement Learning for Reactive Grid scheduling

Julien Perez; Balázs Kégl; Cecile Germain-Renaud

Communication Dans Un Congrès Année : 2011

Non-Markovian Reinforcement Learning for Reactive Grid scheduling

(1) , (2, 3, 4) , (2, 4)

1
2
3
4

Julien Perez

Fonction : Auteur
PersonId : 899399

Département Informatique

Balázs Kégl

Fonction : Auteur
PersonId : 842855

Machine Learning and Optimisation

Laboratoire de l'Accélérateur Linéaire

Laboratoire de Recherche en Informatique

Cecile Germain-Renaud

Fonction : Auteur
PersonId : 5317
IdHAL : cecile-germain
IdRef : 030907837

Machine Learning and Optimisation

Laboratoire de Recherche en Informatique

Résumé

Two recurrent questions often appear when solving numerous real world policy search problems. First, the variables defining the so called Markov Decision Process are often continuous, that leads to the necessity for discretization of the considered state/action space or the use of a regression model, often non-linear, to approach the Q-function nee- ded in the reinforcement learning paradigm. Second, the markovian hypothesis is made which is often strongly discutable and can lead to unacceptably suboptimal resulting policies. In this paper, the job scheduling problem in grid infrastructure is modeled as a continuous action-state space, multi-objective reinforcement learning problem, under realistic assumptions ; the high level goals of users, administrators, and shareholders are captured through simple utility functions. So, formalizing the problem as a par- tially observable Markov decision process (POMDP), we detail the algorithm of fitted Q-function learning using an Echo State Network. The experiment, conducted on simu- lation of real grid activity will demonstrate the significative gain of the method against native scheduling infrastructure and a classic feed forward back-propagated neural net- work (FFNN) for Q function learning in the most difficult cases.

Domaines

Calcul parallèle, distribué et partagé [cs.DC]

Fichier principal

esn-cap11.pdf (360.42 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Cecile Germain : Connectez-vous pour contacter le contributeur

https://inria.hal.science/inria-00586504

Soumis le : samedi 16 avril 2011-15:37:16

Dernière modification le : jeudi 11 avril 2024-13:18:11

Archivage à long terme le : dimanche 17 juillet 2011-02:27:18

Dates et versions

inria-00586504 , version 1 (16-04-2011)

Identifiants

HAL Id : inria-00586504 , version 1

Citer

Julien Perez, Balázs Kégl, Cecile Germain-Renaud. Non-Markovian Reinforcement Learning for Reactive Grid scheduling. Conférence Francophone d'Apprentissage, May 2011, Chambéry, France. ⟨inria-00586504⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

IN2P3 INSTITUT-TELECOM EC-PARIS LAL CNRS INRIA TELECOM-SUDPARIS UMR8623 INRIA2 LRI-AO UNIV-PARIS-SACLAY

309 Consultations

138 Téléchargements

Non-Markovian Reinforcement Learning for Reactive Grid scheduling

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager