Adding Double Progressive Widening to Upper Confidence Trees to Cope with Uncertainty in Planning Problems - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 2011

Adding Double Progressive Widening to Upper Confidence Trees to Cope with Uncertainty in Planning Problems

Résumé

Current state of the art methods in energy policy planning only approximate the problem (Linear Programming on a finite sample of scenarios, Dynamic Programming on an approximation of the problem, etc). Monte-Carlo Tree Search (MCTS [3]) seems to be a potential candidate to converge to an exact solution of these problems ([2]). But how fast, and how do key parameters (double/simple progressive widening) influence the rate of convergence (or even the convergence itself), are still open questions. Also, MCTS completely ignores the features of the problem, including the scale of the objective function. In this paper, we present MCTS, and its extension to continuous/stochastic domains. We show that on problems with continuous action spaces and infinite support of random variables, the "vanilla" version of MCTS fails. We also show how the double progressive widening technique success[2] relies on its widening coefficient. We also study the impact of an unknown variance of the random variables, to see if it affects the optimal choice of the widening coefficients.
Fichier principal
Vignette du fichier
ewrl2011_submission_29.pdf (142.25 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-00745207 , version 1 (25-10-2012)

Identifiants

  • HAL Id : hal-00745207 , version 1

Citer

Adrien Couetoux, Hassen Doghmen. Adding Double Progressive Widening to Upper Confidence Trees to Cope with Uncertainty in Planning Problems. The 9th European Workshop on Reinforcement Learning (EWRL-9), Sep 2011, Athens, Greece. ⟨hal-00745207⟩
259 Consultations
335 Téléchargements

Partager

Gmail Facebook X LinkedIn More