Adding Double Progressive Widening to Upper Confidence Trees to Cope with Uncertainty in Planning Problems

Adrien Couetoux; Hassen Doghmen

Communication Dans Un Congrès Année : 2011

Adding Double Progressive Widening to Upper Confidence Trees to Cope with Uncertainty in Planning Problems

(1) , (2)

1
2

Adrien Couetoux

Fonction : Auteur
PersonId : 910214

Laboratoire de Recherche en Informatique

Hassen Doghmen

Fonction : Auteur

Machine Learning and Optimisation

Résumé

Current state of the art methods in energy policy planning only approximate the problem (Linear Programming on a finite sample of scenarios, Dynamic Programming on an approximation of the problem, etc). Monte-Carlo Tree Search (MCTS [3]) seems to be a potential candidate to converge to an exact solution of these problems ([2]). But how fast, and how do key parameters (double/simple progressive widening) influence the rate of convergence (or even the convergence itself), are still open questions. Also, MCTS completely ignores the features of the problem, including the scale of the objective function. In this paper, we present MCTS, and its extension to continuous/stochastic domains. We show that on problems with continuous action spaces and infinite support of random variables, the "vanilla" version of MCTS fails. We also show how the double progressive widening technique success[2] relies on its widening coefficient. We also study the impact of an unknown variance of the random variables, to see if it affects the optimal choice of the widening coefficients.

Domaines

Intelligence artificielle [cs.AI]

Fichier principal

ewrl2011_submission_29.pdf (142.25 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Adrien Couetoux : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-00745207

Soumis le : jeudi 25 octobre 2012-06:42:26

Dernière modification le : lundi 12 février 2024-09:48:04

Archivage à long terme le : samedi 26 janvier 2013-03:05:09

Dates et versions

hal-00745207 , version 1 (25-10-2012)

Identifiants

HAL Id : hal-00745207 , version 1

Citer

Adrien Couetoux, Hassen Doghmen. Adding Double Progressive Widening to Upper Confidence Trees to Cope with Uncertainty in Planning Problems. The 9th European Workshop on Reinforcement Learning (EWRL-9), Sep 2011, Athens, Greece. ⟨hal-00745207⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

EC-PARIS CNRS INRIA GRID5000 UMR8623 INRIA2 LRI-AO UNIV-PARIS-SACLAY SILECS

259 Consultations

335 Téléchargements

Adding Double Progressive Widening to Upper Confidence Trees to Cope with Uncertainty in Planning Problems

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager