Using “Social actions” and RL-algorithms to build policies in DEC-POMDP - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Article Dans Une Revue IADIS International Journal on Computer Science and Information Systems Année : 2009

Using “Social actions” and RL-algorithms to build policies in DEC-POMDP

Vincent Thomas

Résumé

Building individual behaviors to solve collective problems is a major stake whose applications are found in several domains. To do so, Dec-POMDP has been proposed as a formalism for describing multi-agent problems. However, solving a Dec-POMDP turned out to be a NEXP problem. In this study, we introduced the original concept of social action to get round the inherent complexity of Dec-POMDP and we proposed three decentralized reinforcement learning algorithms which approximate the optimal policy in Dec-POMDP. This article analyses the results obtained and argues that this new approach seems promising for automatic top-down collective behavior computation.
Fichier non déposé

Dates et versions

inria-00536851 , version 1 (17-11-2010)

Identifiants

  • HAL Id : inria-00536851 , version 1
  • PRODINRA : 246986

Citer

Vincent Thomas, Mahuna Akplogan. Using “Social actions” and RL-algorithms to build policies in DEC-POMDP. IADIS International Journal on Computer Science and Information Systems, 2009, 4 (3), pp.82-98. ⟨inria-00536851⟩
160 Consultations
0 Téléchargements

Partager

Gmail Facebook X LinkedIn More