Parallel asynchronous distributed computations of optimal control in large state space Markov Decision Processes

Bruno Scherrer

Communication Dans Un Congrès Année : 2003

Parallel asynchronous distributed computations of optimal control in large state space Markov Decision Processes

(1)

Bruno Scherrer

Fonction : Auteur
PersonId : 1406
IdHAL : bruno-scherrer
IdRef : 073360708

Neuromimetic intelligence

Résumé

This paper emphasizes the link between parallel asynchronous distributed computations (PADC) and Markov Decision Processes (MDPs), which are a powerful generic model for computing optimal control. We review some results arguing that reasonably small state space MDPs can be solved with PADC. We then propose a solution for extending these results when the state space is large. This shows that difficult optimal control problems have natural neural network-like solutions and suggests a general methodology for constructing neural networks.

Mots clés

parallel asynchronous distributed computations markovian decision process optimal control processus de décision markovien contrôle optimal réseau de neurones artificiels artificial neural network calcul parallèle asynchrone distribué

Domaines

Autre [cs.OH]

Publications Loria : Connectez-vous pour contacter le contributeur

https://inria.hal.science/inria-00099718

Soumis le : mardi 26 septembre 2006-09:40:36

Dernière modification le : vendredi 24 mars 2023-14:52:48

Dates et versions

inria-00099718 , version 1 (26-09-2006)

Identifiants

HAL Id : inria-00099718 , version 1

Citer

Bruno Scherrer. Parallel asynchronous distributed computations of optimal control in large state space Markov Decision Processes. 11th European Symposium on Artificial Neural Networks - ESANN'03, Apr 2003, Bruges, Belgique, 6 p. ⟨inria-00099718⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA UNIV-LORRAINE INRIA2 LORIA

99 Consultations

0 Téléchargements

Parallel asynchronous distributed computations of optimal control in large state space Markov Decision Processes

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager