Sensor Scheduling for Hunting Elusive Hiding Targets via Whittle's Restless Bandit Index Policy - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 2011

Sensor Scheduling for Hunting Elusive Hiding Targets via Whittle's Restless Bandit Index Policy

José Nino-Mora
  • Fonction : Auteur
  • PersonId : 914488
Sofia S. Villar
  • Fonction : Auteur
  • PersonId : 914489

Résumé

We consider a sensor scheduling model where a set of identical sensors are used to hunt a larger set of heterogeneous targets, each of which is located at a corresponding site. Target states change randomly over discrete time slots between "exposed" and 'hidden," according to Markovian transition probabilities that depend on whether sites are searched or not, so as to make the targets elusive. Sensors are imperfect, failing to detect an exposed target when searching its site with a positive misdetection probability. We formulate as a partially observable Markov decision process the problem of scheduling the sensors to search the sites so as to maximize the expected total discounted value of rewards earned (when targets are hunted) minus search costs incurred. Given the intractability of finding an optimal policy, we introduce a tractable heuristic search policy of priorityindex type based on the Whittle index for restless bandits. Preliminary computational results are reported showing that such a policy is nearly optimal and can substantially outperform the myopic policy and other simple heuristics.
Fichier principal
Vignette du fichier
29-NinoMora.pdf (191.22 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-00644138 , version 1 (23-11-2011)

Identifiants

  • HAL Id : hal-00644138 , version 1

Citer

José Nino-Mora, Sofia S. Villar. Sensor Scheduling for Hunting Elusive Hiding Targets via Whittle's Restless Bandit Index Policy. NetGCOOP 2011 : International conference on NETwork Games, COntrol and OPtimization, Telecom SudParis et Université Paris Descartes, Oct 2011, Paris, France. ⟨hal-00644138⟩

Collections

NETGCOOP2011
77 Consultations
300 Téléchargements

Partager

Gmail Facebook X LinkedIn More