Combining policies: the best of human expertise and neurocontrol

Vincent Berthier; Adrien Couëtoux; Olivier Teytaud

Communication Dans Un Congrès Année : 2015

Combining policies: the best of human expertise and neurocontrol

(1, 2) , (1, 2) , (1, 2)

1
2

Vincent Berthier

Fonction : Auteur
PersonId : 5464
IdHAL : vincent-berthier

Laboratoire de Recherche en Informatique

Machine Learning and Optimisation

Adrien Couëtoux

Fonction : Auteur

Laboratoire de Recherche en Informatique

Machine Learning and Optimisation

Olivier Teytaud

Fonction : Auteur
PersonId : 581
IdHAL : olivier-teytaud
IdRef : 05971008X

Laboratoire de Recherche en Informatique

Machine Learning and Optimisation

Résumé

We consider sequential decision making in the case where a generative model and a parametric policy are available. Such a framework is naturally tackled with Direct Policy Search, i.e. parametric op-timisation over simulations. We propose a simple method that combines this parametric policy with a more generic neural network, where all parameters are trained simultaneously. As such, our approach doesn't require any computational overhead. We show that the resulting policy significantly outperforms both the domain specific policies and the neural network on a unit commitment test problem.

Domaines

Optimisation et contrôle [math.OC] Intelligence artificielle [cs.AI]

Fichier principal

EAsource.pdf (1.36 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Olivier Teytaud : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-01194516

Soumis le : lundi 7 septembre 2015-10:42:01

Dernière modification le : lundi 22 avril 2024-10:06:24

Archivage à long terme le : mardi 8 décembre 2015-11:38:11

Dates et versions

hal-01194516 , version 1 (07-09-2015)

Identifiants

HAL Id : hal-01194516 , version 1

Citer

Vincent Berthier, Adrien Couëtoux, Olivier Teytaud. Combining policies: the best of human expertise and neurocontrol. Artificial Evolution 2015, 2015, Lyon, France. To appear. ⟨hal-01194516⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA UMR8623 CENTRALESUPELEC INRIA2 LRI-AO TDS-MACS UNIV-PARIS-SACLAY GS-COMPUTER-SCIENCE

221 Consultations

123 Téléchargements

Combining policies: the best of human expertise and neurocontrol

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager