Online Natural Gradient as a Kalman Filter

Yann Ollivier

Pré-Publication, Document De Travail Année : 2017

Online Natural Gradient as a Kalman Filter

(1, 2, 3)

1
2
3

Yann Ollivier

Fonction : Auteur
PersonId : 883809

TAckling the Underspecified

Centre National de la Recherche Scientifique

Laboratoire de Recherche en Informatique

Résumé

We establish a full relationship between Kalman filtering and Amari's natural gradient in statistical learning. Namely, using an online natural gradient descent on data log-likelihood to evaluate the parameter of a probabilistic model from a series of observations, is exactly equivalent to using an extended Kalman filter to estimate the parameter (assumed to have constant dynamics). In the i.i.d. case, this relation is a consequence of the "information filter" phrasing of the extended Kalman filter. In the recurrent (state space, non-i.i.d.) case, we prove that the joint Kalman filter over states and parameters is a natural gradient on top of real-time recurrent learning (RTRL), a classical algorithm to train recurrent models. This exact algebraic correspondence provides relevant settings for natural gradient hyperparameters such as learning rates or initialization and regularization of the Fisher information matrix.

Domaines

Apprentissage [cs.LG] Optimisation et contrôle [math.OC]

Yann Ollivier : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-01660622

Soumis le : lundi 11 décembre 2017-10:50:31

Dernière modification le : vendredi 19 avril 2024-14:42:32

Dates et versions

hal-01660622 , version 1 (11-12-2017)

Identifiants

HAL Id : hal-01660622 , version 1
ARXIV : 1703.00209

Citer

Yann Ollivier. Online Natural Gradient as a Kalman Filter. 2017. ⟨hal-01660622⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA UMR8623 CENTRALESUPELEC INRIA2 LRI-AO TDS-MACS UNIV-PARIS-SACLAY LISN GS-ENGINEERING GS-COMPUTER-SCIENCE LISN-AO

273 Consultations

0 Téléchargements

Online Natural Gradient as a Kalman Filter

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager