Q-Learning with Double Progressive Widening : Application to Robotics - Inria - Institut national de recherche en sciences et technologies du numérique Access content directly
Conference Papers Year : 2011

Q-Learning with Double Progressive Widening : Application to Robotics

Abstract

Discretization of state and action spaces is a critical issue in $Q$-Learning. In our contribution, we propose a real-time adaptation of the discretization by the progressive widening technique which has been already used in bandit-based methods. Results are consistently converging to the optimum of the problem, without changing the parametrization for each new problem.
Fichier principal
Vignette du fichier
ICONIP-0854.pdf (202.31 Ko) Télécharger le fichier
Origin : Files produced by the author(s)
Loading...

Dates and versions

hal-00624832 , version 1 (20-09-2011)

Identifiers

  • HAL Id : hal-00624832 , version 1

Cite

Nataliya Sokolovska, Olivier Teytaud, Mario Milone. Q-Learning with Double Progressive Widening : Application to Robotics. ICONIP 2011, Nov 2011, China. pp.103-112. ⟨hal-00624832⟩
329 View
263 Download

Share

Gmail Facebook X LinkedIn More