Taylor expansion of discount factors

Yunhao Tang; Mark Rowland; Rémi Munos; Michal Valko

Communication Dans Un Congrès Année : 2021

Taylor expansion of discount factors

(1) , (2) , (3) , (3)

1
2
3

Yunhao Tang

Fonction : Auteur

Columbia University [New York]

Mark Rowland

Fonction : Auteur

DeepMind [London]

Rémi Munos

Fonction : Auteur

DeepMind [Paris]

Michal Valko

Fonction : Auteur
PersonId : 284
IdHAL : michal
IdRef : 22360934X

DeepMind [Paris]

Résumé

In practical reinforcement learning (RL), the discount factor used for estimating value functions often differs from that used for defining the evaluation objective. In this work, we study the effect that this discrepancy of discount factors has during learning, and discover a family of objectives that interpolate value functions of two distinct discount factors. Our analysis suggests new ways for estimating value functions and performing policy optimization updates, which demonstrate empirical performance gains. This framework also leads to new insights on commonly-used deep RL heuristic modifications to policy optimization algorithms.

Domaines

Machine Learning [stat.ML]

Fichier principal

tang2021taylor.pdf (2.45 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Michal Valko : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-03289295

Soumis le : vendredi 16 juillet 2021-17:55:16

Dernière modification le : mardi 15 février 2022-11:02:04

Archivage à long terme le : dimanche 17 octobre 2021-19:47:28

Dates et versions

hal-03289295 , version 1 (16-07-2021)

Identifiants

HAL Id : hal-03289295 , version 1

Citer

Yunhao Tang, Mark Rowland, Rémi Munos, Michal Valko. Taylor expansion of discount factors. International Conference on Machine Learning, Jul 2021, Vienna / Virtual, Austria. ⟨hal-03289295⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

30 Consultations

42 Téléchargements

Taylor expansion of discount factors

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Partager