467 articles – 709 references  [version française]

inria-00535559, version 1

A Closer Look at MOMDPs

Mauricio Araya-López () a1, Vincent Thomas () b1, Olivier Buffet () a1, François Charpillet () a1

22nd International Conference on Tools with Artificial Intelligence - ICTAI 2010 (2010)

Abstract: The difficulties encountered in sequential decision-making problems under uncertainty are often linked to the large size of the state space. Exploiting the structure of the problem, for example by employing a factored representation, is usually an efficient approach but, in the case of partially observable Markov decision processes, the fact that some state variables may be visible has not been sufficiently appreciated. In this article, we present a complementary analysis and discussion about MOMDPs, a formalism that exploits the fact that the state space may be factored in one visible part and one hidden part. Starting from a POMDP description, we dig into the structure of the belief update, value function, and the consequences in value iteration, specifically how classical algorithms can be adapted to this factorization, and demonstrate the resulting benefits through an empirical evaluation.

  • a –  INRIA
  • b –  Université Nancy II
  • 1:  MAIA (INRIA Lorraine - LORIA)
  • INRIA – CNRS : UMR7503 – Université Henri Poincaré - Nancy I – Université Nancy II – Institut National Polytechnique de Lorraine (INPL)
  • Domain : Computer Science/Artificial Intelligence
  • Keywords : Partially Observable Markov Decision Processes – Mixed Observability Markov Decision Processes – POMDP – MOMDP
 
  • inria-00535559, version 1
  • oai:hal.inria.fr:inria-00535559
  • From: 
  • Submitted on: Thursday, 11 November 2010 20:16:21
  • Updated on: Monday, 15 November 2010 19:06:11