Non-asymptotic Analysis of Biased Stochastic Approximation Scheme

Belhal Karimi; Blazej Miasojedow; Éric Moulines; Hoi-To Wai

Communication Dans Un Congrès Année : 2019

Non-asymptotic Analysis of Biased Stochastic Approximation Scheme

(1, 2) , (3) , (1, 2) , (4)

1
2
3
4

Belhal Karimi

Fonction : Auteur

Centre de Mathématiques Appliquées - Ecole Polytechnique

Modélisation en pharmacologie de population

Blazej Miasojedow

Fonction : Auteur
PersonId : 1047262

Faculty of Mathematics, Informatics, and Mechanics [Warsaw]

Éric Moulines

Fonction : Auteur
PersonId : 1350242
ORCID : 0000-0002-2058-0693
IdRef : 076452476

Centre de Mathématiques Appliquées - Ecole Polytechnique

Modélisation en pharmacologie de population

Hoi-To Wai

Fonction : Auteur
PersonId : 1047263

Department of Systems Engineering and Engineering Management

Résumé

Stochastic approximation (SA) is a key method used in statistical learning. Recently, its non-asymptotic convergence analysis has been considered in many papers. However, most of the prior analyses are made under restrictive assumptions such as unbiased gradient estimates and convex objective function, which significantly limit their applications to sophisticated tasks such as online and reinforcement learning. These restrictions are all essentially relaxed in this work. In particular, we analyze a general SA scheme to minimize a non-convex, smooth objective function. We consider update procedure whose drift term depends on a state-dependent Markov chain and the mean field is not necessarily of gradient type, covering approximate second-order method and allowing asymptotic bias for the one-step updates. We illustrate these settings with the online EM algorithm and the policy-gradient method for average reward maximization in reinforcement learning.

Mots clés

Online expectation-maximization Policy gradient Non-convex optimization State-dependent Markov chain Biased stochastic approximation

Domaines

Machine Learning [stat.ML]

Fichier principal

colt_Revised.pdf (455.74 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Belhal Karimi : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-02127750

Soumis le : lundi 13 mai 2019-16:49:06

Dernière modification le : vendredi 1 mars 2024-13:24:17

Dates et versions

hal-02127750 , version 1 (13-05-2019)

Identifiants

HAL Id : hal-02127750 , version 1

Citer

Belhal Karimi, Blazej Miasojedow, Éric Moulines, Hoi-To Wai. Non-asymptotic Analysis of Biased Stochastic Approximation Scheme. COLT 2019 - 32nd Annual Conference on Conference on Learning Theory, Jun 2019, Phoenix, United States. pp.1 - 33. ⟨hal-02127750⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

X CNRS INRIA X-CMAP X-DEP-MATHA CMAP INRIA2 UNIV-PARIS-SACLAY IP_PARIS GS-COMPUTER-SCIENCE

149 Consultations

135 Téléchargements

Non-asymptotic Analysis of Biased Stochastic Approximation Scheme

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager