Learning a Move-Generator for Upper Con dence Trees

Adrien Couetoux; Olivier Teytaud; Hassen Doghmen

Communication Dans Un Congrès Année : 2012

Learning a Move-Generator for Upper Con dence Trees

(1) , (1, 2) , (2)

1
2

Adrien Couetoux

Fonction : Auteur
PersonId : 910214

Laboratoire de Recherche en Informatique

Olivier Teytaud

Fonction : Auteur
PersonId : 581
IdHAL : olivier-teytaud
IdRef : 05971008X

Laboratoire de Recherche en Informatique

Machine Learning and Optimisation

Hassen Doghmen

Fonction : Auteur

Machine Learning and Optimisation

Résumé

We experiment the introduction of machine learning tools to improve Monte-Carlo Tree Search. More precisely, we propose the use of Direct Policy Search, a classical reinforcement learning paradigm, to learn the Monte-Carlo Move Generator. We experiment our algorithm on di erent forms of unit commitment problems, including experiments on a problem with both macrolevel and microlevel decisions.

Mots clés

reinforcement learning stochastic planning direct policy search

Domaines

Intelligence artificielle [cs.AI]

Fichier principal

uctWithDPS.pdf (220 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Adrien Couetoux : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-00759822

Soumis le : lundi 3 décembre 2012-04:50:42

Dernière modification le : lundi 12 février 2024-09:48:04

Archivage à long terme le : lundi 4 mars 2013-03:44:37

Dates et versions

hal-00759822 , version 1 (03-12-2012)

Identifiants

HAL Id : hal-00759822 , version 1

Citer

Adrien Couetoux, Olivier Teytaud, Hassen Doghmen. Learning a Move-Generator for Upper Con dence Trees. International Computer Symposium 2012, Dec 2012, Hualien, Taiwan. ⟨hal-00759822⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

EC-PARIS CNRS INRIA UMR8623 INRIA2 LRI-AO UNIV-PARIS-SACLAY

282 Consultations

289 Téléchargements

Learning a Move-Generator for Upper Con dence Trees

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager