Combining policies: the best of human expertise and neurocontrol - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 2015

Combining policies: the best of human expertise and neurocontrol

Résumé

We consider sequential decision making in the case where a generative model and a parametric policy are available. Such a framework is naturally tackled with Direct Policy Search, i.e. parametric op-timisation over simulations. We propose a simple method that combines this parametric policy with a more generic neural network, where all parameters are trained simultaneously. As such, our approach doesn't require any computational overhead. We show that the resulting policy significantly outperforms both the domain specific policies and the neural network on a unit commitment test problem.
Fichier principal
Vignette du fichier
EAsource.pdf (1.36 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01194516 , version 1 (07-09-2015)

Identifiants

  • HAL Id : hal-01194516 , version 1

Citer

Vincent Berthier, Adrien Couëtoux, Olivier Teytaud. Combining policies: the best of human expertise and neurocontrol. Artificial Evolution 2015, 2015, Lyon, France. To appear. ⟨hal-01194516⟩
221 Consultations
123 Téléchargements

Partager

Gmail Facebook X LinkedIn More