The Optimal Control of Partially Observable Markov Processes over a Finite Horizon, Operations Research, vol.21, issue.5, pp.1071-1088, 1971. ,
DOI : 10.1287/opre.21.5.1071
Applied Probability Models with Optimization Applications, 1970. ,
Betting on Gilbert-Elliott Channels, IEEE Transactions on Wireless Communications, vol.50, issue.3, pp.484-494, 2010. ,
Finite-time analysis of the multiarmed bandit problem, Machine Learning, pp.235-56, 2002. ,
Opportunistic file transfer over a fading channel: A POMDP search theory formulation with optimal threshold policies, IEEE Transactions on Wireless Communications, vol.5, issue.2, pp.394-405, 2006. ,
DOI : 10.1109/TWC.2006.1611063
Optimal and suboptimal packet scheduling over correlated time varying flat fading channels, IEEE Transactions on Wireless Communications, vol.5, issue.2, 2006. ,
DOI : 10.1109/TWC.2006.1611068
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.421.7605
On myopic sensing for multi-channel opportunistic access: structure, optimality, and performance, IEEE Transactions on Wireless Communications, vol.7, issue.12, pp.5431-5440, 2008. ,
DOI : 10.1109/T-WC.2008.071349
Optimality of Myopic Sensing in Multichannel Opportunistic Access, IEEE Transactions on Information Theory, vol.55, issue.9, pp.4040-4050, 2009. ,
DOI : 10.1109/TIT.2009.2025561
The non-Bayesian restless multi-armed bandit: A case of near-logarithmic regret, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2011. ,
DOI : 10.1109/ICASSP.2011.5946273
On a restless multi-armed bandit problem with non-identical arms, 2011 49th Annual Allerton Conference on Communication, Control, and Computing (Allerton), 2011. ,
DOI : 10.1109/Allerton.2011.6120191