Fundamental Design Principles for Reinforcement Learning Algorithms

Adithya Devraj; Ana Bušić; Sean Meyn

doi:10.1007/978-3-030-60990-0_4

Chapitre D'ouvrage Année : 2021

Fundamental Design Principles for Reinforcement Learning Algorithms

(1) , (2, 3) , (1)

1
2
3

Adithya Devraj

Fonction : Auteur

Department of Electrical and Computer Engineering [Gainesville]

Ana Bušić

Fonction : Auteur
PersonId : 2602
IdHAL : anabusic
ORCID : 0000-0002-4133-3739
IdRef : 144488175

Dynamics of Geometric Networks

Laboratory of Information, Network and Communication Sciences

Sean Meyn

Fonction : Auteur

Department of Electrical and Computer Engineering [Gainesville]

Résumé

Along with the sharp increase in visibility of the field, the rate at which new reinforcement learning algorithms are being proposed is at a new peak. While the surge in activity is creating excitement and opportunities, there is a gap in understanding of two basic principles that these algorithms need to satisfy for any successful application. One has to do with guarantees for convergence, and the other concerns the convergence rate. The vast majority of reinforcement learning algorithms belong to a class of learning algorithms known as stochastic approximation (SA). The objective here is to review the foundations of reinforcement learning algorithm design based on recent and ancient results from SA. In particular, it was established in [Borkar and Meyn, 2000] that both stability and convergence of these algorithms are guaranteed by analyzing the stability of two associated ODEs. Moreover, if the linearized ODE passes a simple eigenvalue test, then an optimal rate of convergence is guaranteed. This chapter contains a survey of these concepts, along with a survey of the new class of Zap reinforcement learning algorithms introduced by the authors. These algorithms can achieve convergence almost universally, while also guaranteeing optimal rate of convergence.

Domaines

Optimisation et contrôle [math.OC] Apprentissage [cs.LG]

Ana Busic : Connectez-vous pour contacter le contributeur

https://hal.science/hal-03541756

Soumis le : lundi 24 janvier 2022-19:55:33

Dernière modification le : vendredi 19 avril 2024-16:18:58

Dates et versions

hal-03541756 , version 1 (24-01-2022)

Identifiants

HAL Id : hal-03541756 , version 1
DOI : 10.1007/978-3-030-60990-0_4

Citer

Adithya Devraj, Ana Bušić, Sean Meyn. Fundamental Design Principles for Reinforcement Learning Algorithms. Kyriakos G. Vamvoudakis; Yan Wan; Frank L. Lewis; Derya Cansever. Handbook of Reinforcement Learning and Control, 325, Springer International Publishing, pp.75-137, 2021, Studies in Systems, Decision and Control, ⟨10.1007/978-3-030-60990-0_4⟩. ⟨hal-03541756⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

INSTITUT-TELECOM ENS-PARIS CNRS INRIA INRIA2 TDS-MACS PSL SORBONNE-UNIVERSITE ANR

99 Consultations

0 Téléchargements

Fundamental Design Principles for Reinforcement Learning Algorithms

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager