Sensor Scheduling for Hunting Elusive Hiding Targets via Whittle's Restless Bandit Index Policy

Abstract : We consider a sensor scheduling model where a set of identical sensors are used to hunt a larger set of heterogeneous targets, each of which is located at a corresponding site. Target states change randomly over discrete time slots between "exposed" and 'hidden," according to Markovian transition probabilities that depend on whether sites are searched or not, so as to make the targets elusive. Sensors are imperfect, failing to detect an exposed target when searching its site with a positive misdetection probability. We formulate as a partially observable Markov decision process the problem of scheduling the sensors to search the sites so as to maximize the expected total discounted value of rewards earned (when targets are hunted) minus search costs incurred. Given the intractability of finding an optimal policy, we introduce a tractable heuristic search policy of priorityindex type based on the Whittle index for restless bandits. Preliminary computational results are reported showing that such a policy is nearly optimal and can substantially outperform the myopic policy and other simple heuristics.
Type de document :
Communication dans un congrès
Roberto Cominetti and Sylvain Sorin and Bruno Tuffin. NetGCOOP 2011 : International conference on NETwork Games, COntrol and OPtimization, Oct 2011, Paris, France. IEEE, 2011
Liste complète des métadonnées

Littérature citée [12 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/hal-00644138
Contributeur : Service Ist Inria Sophia Antipolis-Méditerranée / I3s <>
Soumis le : mercredi 23 novembre 2011 - 16:29:19
Dernière modification le : mercredi 23 novembre 2011 - 17:22:52
Document(s) archivé(s) le : lundi 5 décembre 2016 - 10:14:24

Fichier

29-NinoMora.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-00644138, version 1

Collections

Citation

José Nino-Mora, Sofia Villar. Sensor Scheduling for Hunting Elusive Hiding Targets via Whittle's Restless Bandit Index Policy. Roberto Cominetti and Sylvain Sorin and Bruno Tuffin. NetGCOOP 2011 : International conference on NETwork Games, COntrol and OPtimization, Oct 2011, Paris, France. IEEE, 2011. 〈hal-00644138〉

Partager

Métriques

Consultations de la notice

112

Téléchargements de fichiers

224