Abstract : This report introduces a new approach to solve sensor management problems. Classically sensor management problems are formalized as Partially-Observed Markov Decision Process (POMPD). Our original approach consists in deriving the optimal parameterized policy based on stochastic gradient estimation. Two differents techniques nammed Infinitesimal Approximation (IPA) and Likelihood Ratio (LR) can be used to adress such a problem. This report discusses how these methods can be used for gradient estimation in the context of sensor management . The effectiveness of this general framework is illustrated by the managing of an Active Electronically Scanned Array Radar (AESA Radar).
https://hal.inria.fr/inria-00188292 Contributor : Rapport de Recherche InriaConnect in order to contact the contributor Submitted on : Monday, November 19, 2007 - 10:52:43 AM Last modification on : Thursday, January 20, 2022 - 4:12:31 PM Long-term archiving on: : Tuesday, September 21, 2010 - 2:38:02 PM