Skip to Main content Skip to Navigation
Conference papers

Online Learning to Optimize Transmission over an Unknown Gilbert-Elliott Channel

Abstract : This paper studies the optimal transmission policy for a Gilbert-Elliott Channel. The transmitter has two actions: sending aggressively or sending conservatively, with rewards depending on the action chosen and the underlying channel state. The aim is to compute the scheduling policy to determine which actions to choose at each time slot in order to maximize the expected total discounted reward. We first establish the threshold structure of the optimal policy when the underlying channel statistics are known. We then consider the more challenging case when the statistics are unknown. For this problem, we map different threshold policies to arms of a suitably defined multiarmed bandit problem. To tractably handle the complexity introduced by countably infinite arms and the infinite time horizon, we weaken our objective a little: finding a (OPT ( + ))- approximate policy instead. We present the UCB-P algorithm, which can achieve this objective with logarithmic-time regret.
Complete list of metadata

Cited literature [10 references]  Display  Hide  Download

https://hal.inria.fr/hal-00763262
Contributor : Service Ist Inria Sophia Antipolis-Méditerranée / I3s <>
Submitted on : Monday, December 10, 2012 - 2:23:16 PM
Last modification on : Sunday, December 17, 2017 - 7:04:03 AM
Long-term archiving on: : Monday, March 11, 2013 - 12:31:26 PM

File

p27-wu.pdf
Explicit agreement for this submission

Identifiers

  • HAL Id : hal-00763262, version 1

Collections

Citation

Yanting Wu, Bhaskar Krishnamachari. Online Learning to Optimize Transmission over an Unknown Gilbert-Elliott Channel. WiOpt'12: Modeling and Optimization in Mobile, Ad Hoc, and Wireless Networks, May 2012, Paderborn, Germany. pp.27-32. ⟨hal-00763262⟩

Share

Metrics

Record views

109

Files downloads

176