Skip to Main content Skip to Navigation
Conference papers

A Structure of Restricted Boltzmann Machine for Modeling System Dynamics

Guillaume Padiolleau 1 Olivier Bach 2 Alain Hugget 2 Denis Penninckx 2 Frédéric Alexandre 1
1 Mnemosyne - Mnemonic Synergy
LaBRI - Laboratoire Bordelais de Recherche en Informatique, Inria Bordeaux - Sud-Ouest, IMN - Institut des Maladies Neurodégénératives [Bordeaux]
Abstract : This paper presents a new approach for learning transition function in state representation learning (SRL) for control. While state-of-the-art methods use different deterministic neural networks to learn forward and inverse state transition functions independently with auto-supervised learning, we introduce a bidirectional stochastic model to learn both transition functions. We aim at using the uncertainty of the model on its predictions as an intrinsic motivation for exploration to enhance the representation learning. More, using the same model to learn both transition functions allows sharing the parameters, which can reduce their number and should increase the embedding quality of the representation. We use a factored restricted Boltzmann machine (fRBM) based model, enhanced with dedicated structure for learning system dynamics and transitions with shared parameters. The presented work focuses on building the structure of the bidirectional transition model for unsupervised learning. Our fRBM structure is directly inspired from physics interactions between inputs and outputs in reinforcement learning framework. We compare different training algorithms for learning the model that must be able to predict observable random variables to be used in SRL framework. Our structure is not restricted to any type of observable, nevertheless in this paper we focus on learning dynamics from the OpenAI Gym environment Swinging Pendulum. We show that the proposed structure is able to learn bidirectional transition function and performs well in prediction task.
Document type :
Conference papers
Complete list of metadata

Cited literature [27 references]  Display  Hide  Download

https://hal.inria.fr/hal-02925519
Contributor : Frédéric Alexandre <>
Submitted on : Sunday, August 30, 2020 - 9:56:18 AM
Last modification on : Tuesday, September 1, 2020 - 3:30:33 AM
Long-term archiving on: : Tuesday, December 1, 2020 - 8:37:06 AM

File

Final_A_Structure_of_RBM_for_M...
Files produced by the author(s)

Identifiers

  • HAL Id : hal-02925519, version 1

Collections

Citation

Guillaume Padiolleau, Olivier Bach, Alain Hugget, Denis Penninckx, Frédéric Alexandre. A Structure of Restricted Boltzmann Machine for Modeling System Dynamics. IJCNN 2020 - International Joint Conference on Neural Networks, IEEE, Jul 2020, Glasgow, United Kingdom. pp.8. ⟨hal-02925519⟩

Share

Metrics

Record views

52

Files downloads

179