Q-Learning with Double Progressive Widening : Application to Robotics

Nataliya Sokolovska; Olivier Teytaud; Mario Milone

Conference Papers Year : 2011

Q-Learning with Double Progressive Widening : Application to Robotics

(1, 2) , (1, 2) , (1)

1
2

Nataliya Sokolovska

Function : Author
PersonId : 879120

Laboratoire de Recherche en Informatique

Machine Learning and Optimisation

Olivier Teytaud

Function : Author
PersonId : 581
IdHAL : olivier-teytaud
IdRef : 05971008X

Laboratoire de Recherche en Informatique

Machine Learning and Optimisation

Mario Milone

Function : Author

Laboratoire de Recherche en Informatique

Abstract

Discretization of state and action spaces is a critical issue in $Q$-Learning. In our contribution, we propose a real-time adaptation of the discretization by the progressive widening technique which has been already used in bandit-based methods. Results are consistently converging to the optimum of the problem, without changing the parametrization for each new problem.

Domains

Machine Learning [cs.LG]

Fichier principal

ICONIP-0854.pdf (202.31 Ko)

Origin : Files produced by the author(s)

Nataliya Sokolovska : Connect in order to contact the contributor

https://hal.science/hal-00624832

Submitted on : Tuesday, September 20, 2011-3:23:55 AM

Last modification on : Monday, February 12, 2024-9:48:04 AM

Long-term archiving on: Tuesday, November 13, 2012-2:00:51 PM

Dates and versions

hal-00624832 , version 1 (20-09-2011)

Identifiers

HAL Id : hal-00624832 , version 1

Cite

Nataliya Sokolovska, Olivier Teytaud, Mario Milone. Q-Learning with Double Progressive Widening : Application to Robotics. ICONIP 2011, Nov 2011, China. pp.103-112. ⟨hal-00624832⟩

Export

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

EC-PARIS CNRS INRIA UMR8623 INRIA2 LRI-AO UNIV-PARIS-SACLAY

329 View

263 Download

Q-Learning with Double Progressive Widening : Application to Robotics

Abstract

Domains

Dates and versions

Identifiers

Cite

Export

Collections

Share