Optimal Policies Search for Sensor Management

Thomas Bréhard; Emmanuel Duflos; Philippe Vanheeghe; Pierre-Arnaud Coquelin

Communication Dans Un Congrès Année : 2008

Optimal Policies Search for Sensor Management

(1) , (1, 2) , (1, 2) , (1)

1
2

Thomas Bréhard

Fonction : Auteur

Sequential Learning

Emmanuel Duflos

Fonction : Auteur
PersonId : 844358

Sequential Learning

LAGIS-SI

Philippe Vanheeghe

Fonction : Auteur

Sequential Learning

LAGIS-SI

Pierre-Arnaud Coquelin

Fonction : Auteur
PersonId : 844357

Sequential Learning

Résumé

This paper introduces a new approach to solve sensor management problems. Classically sensor management problems can be well formalized as Partially-Observed Markov Decision Processes (POMPD). The original approach developped here consists in deriving the optimal parameterized policy based on a stochastic gradient estimation. We assume in this work that it is possible to learn the optimal policy off-line (in simulation ) using models of the environement and of the sensor(s). The learned policy can then be used to manage the sensor(s). In order to approximate the gradient in a stochastic context, we introduce a new method to approximate the gradient, based on Infinitesimal Perturbation Approximation (IPA). The effectiveness of this general framework is illustrated by the managing of an Electronically Scanned Array Radar. First simulations results are finally proposed.

Mots clés

Sensor(s) Management Partially Observable Markov Decision Process Stochastic Gradient Estimation AESA Radar

Domaines

Traitement du signal et de l'image [eess.SP] Apprentissage [cs.LG] Applications [stat.AP] Traitement du signal et de l'image [eess.SP]

Fichier principal

Fusion2008_SensorManagement_EDuflos.pdf (131.95 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Emmanuel Duflos : Connectez-vous pour contacter le contributeur

https://inria.hal.science/inria-00368875

Soumis le : jeudi 19 mars 2009-14:37:16

Dernière modification le : jeudi 15 février 2024-03:30:54

Archivage à long terme le : mardi 8 juin 2010-21:37:41

Dates et versions

inria-00368875 , version 1 (19-03-2009)

Identifiants

HAL Id : inria-00368875 , version 1
ARXIV : 0903.3329

Citer

Thomas Bréhard, Emmanuel Duflos, Philippe Vanheeghe, Pierre-Arnaud Coquelin. Optimal Policies Search for Sensor Management. FUSION 2008, Jun 2008, Cologne, Germany. pp.1 - 8. ⟨inria-00368875⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-RENNES1 UNIV-LILLE3 CNRS INRIA IRISA LAGIS LAGIS-SI INRIA2 UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES UR1-MATH-NUM

193 Consultations

102 Téléchargements

Optimal Policies Search for Sensor Management

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager