Improving the exploration in Upper Confidence Trees

Adrien Couetoux; Hassen Doghmen; Olivier Teytaud

Communication Dans Un Congrès Année : 2012

Improving the exploration in Upper Confidence Trees

(1) , (2) , (1, 2, 3)

1
2
3

Adrien Couetoux

Fonction : Auteur
PersonId : 910214

Laboratoire de Recherche en Informatique

Hassen Doghmen

Fonction : Auteur

Machine Learning and Optimisation

Olivier Teytaud

Fonction : Auteur
PersonId : 581
IdHAL : olivier-teytaud
IdRef : 05971008X

Laboratoire de Recherche en Informatique

Machine Learning and Optimisation

Department of Electrical Engineering and Computer Science

Résumé

In the standard version of the UCT algorithm, in the case of a continuous set of decisions, the exploration of new decisions is done through blind search. This can lead to very inefficient exploration, par- ticularly in the case of large dimension problems, which often happens in energy management problems, for instance. In an attempt to use the information gathered through past simulations to better explore new de- cisions, we propose a method named Blind Value (BV). It only requires the access to a function that randomly draws feasible decisions. We also implement it and compare it to the original version of continuous UCT. Our results show that it gives a significant increase in convergence speed, in dimensions 12 and 80.

Domaines

Intelligence artificielle [cs.AI]

Fichier principal

BV.pdf (105.12 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Adrien Couetoux : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-00745208

Soumis le : jeudi 25 octobre 2012-06:46:39

Dernière modification le : lundi 12 février 2024-09:48:04

Archivage à long terme le : samedi 26 janvier 2013-03:37:46

Dates et versions

hal-00745208 , version 1 (25-10-2012)

Identifiants

HAL Id : hal-00745208 , version 1

Citer

Adrien Couetoux, Hassen Doghmen, Olivier Teytaud. Improving the exploration in Upper Confidence Trees. Learning and Intelligent OptimizatioN Conference LION 6, Jan 2012, Paris, France. ⟨hal-00745208⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

EC-PARIS CNRS INRIA GRID5000 UMR8623 INRIA2 LRI-AO UNIV-PARIS-SACLAY SILECS

182 Consultations

355 Téléchargements

Improving the exploration in Upper Confidence Trees

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager