Optimal Policies Search for Sensor Management

Thomas Bréhard 1 Emmanuel Duflos 1, 2 Philippe Vanheeghe 1, 2 Pierre-Arnaud Coquelin 1
1 SEQUEL - Sequential Learning
LIFL - Laboratoire d'Informatique Fondamentale de Lille, Inria Lille - Nord Europe, LAGIS - Laboratoire d'Automatique, Génie Informatique et Signal
LAGIS - Laboratoire d'Automatique, Génie Informatique et Signal
Abstract : This paper introduces a new approach to solve sensor management problems. Classically sensor management problems can be well formalized as Partially-Observed Markov Decision Processes (POMPD). The original approach developped here consists in deriving the optimal parameterized policy based on a stochastic gradient estimation. We assume in this work that it is possible to learn the optimal policy off-line (in simulation ) using models of the environement and of the sensor(s). The learned policy can then be used to manage the sensor(s). In order to approximate the gradient in a stochastic context, we introduce a new method to approximate the gradient, based on Infinitesimal Perturbation Approximation (IPA). The effectiveness of this general framework is illustrated by the managing of an Electronically Scanned Array Radar. First simulations results are finally proposed.
Type de document :
Communication dans un congrès
FUSION 2008, Jun 2008, Cologne, Germany. pp.1 - 8, 2008
Liste complète des métadonnées

Littérature citée [22 références]  Voir  Masquer  Télécharger

Contributeur : Emmanuel Duflos <>
Soumis le : jeudi 19 mars 2009 - 14:37:16
Dernière modification le : jeudi 11 janvier 2018 - 06:26:40
Document(s) archivé(s) le : mardi 8 juin 2010 - 21:37:41


Fichiers produits par l'(les) auteur(s)


  • HAL Id : inria-00368875, version 1
  • ARXIV : 0903.3329



Thomas Bréhard, Emmanuel Duflos, Philippe Vanheeghe, Pierre-Arnaud Coquelin. Optimal Policies Search for Sensor Management. FUSION 2008, Jun 2008, Cologne, Germany. pp.1 - 8, 2008. 〈inria-00368875〉



Consultations de la notice


Téléchargements de fichiers