Exponential Convergence Rate for the Asymptotic Optimality of Whittle Index Policy - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Pré-Publication, Document De Travail Année : 2020

Exponential Convergence Rate for the Asymptotic Optimality of Whittle Index Policy

Résumé

We evaluate the performance of Whittle index policy for restless Markovian bandits, when the number of bandits grows. It is proven in [30] that this performance is asymptotically optimal if the bandits are indexable and the associated deterministic system has a global attractor fixed point. In this paper we show that, under the same conditions, the convergence rate is exponential in the number of bandits, unless the fixed point is singular (to be defined later). Our proof is based on the nature of the deterministic equation governing the stochastic system: We show that it is a piecewise affine continuous dynamical system inside the simplex of the empirical measure of the bandits. Using simulations and numerical solvers, we also investigate the cases where the conditions for the exponential rate theorem are violated, notably when attracting limit cycles appear, or when the fixed point is singular. We illustrate our theorem on a Markovian fading channel model, which has been well studied in the literature. Finally, we extend our synchronous model results to the asynchronous model.
Fichier principal
Vignette du fichier
optimality_whittle.pdf (925.88 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-03041176 , version 1 (11-12-2020)
hal-03041176 , version 2 (19-07-2023)

Identifiants

Citer

Nicolas Gast, Bruno Gaujal, Chen Yan. Exponential Convergence Rate for the Asymptotic Optimality of Whittle Index Policy. 2020. ⟨hal-03041176v1⟩
106 Consultations
180 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More