C. K. Carter and R. Kohn, On Gibbs sampling for state space models, Biometrika, vol.81, issue.3, pp.541-553, 1994.
DOI : 10.1093/biomet/81.3.541

L. Chrisman, Reinforcement learning with perceptual aliasing: The perceptual distinctions approach, Proc. AAAI, pp.183-188, 1992.

F. Doshi-velez, The infinite partially observable Markov decision process, Proc. NIPS, 2009.

H. Ishwaran and L. F. James, Gibbs Sampling Methods for Stick-Breaking Priors, Journal of the American Statistical Association, vol.96, issue.453, pp.161-173, 2001.
DOI : 10.1198/016214501750332758

R. Jaulmes, J. Pineau, and D. Precup, Learning in non-stationary Partially Observable Markov Decision Processes, ECML Workshop on Reinforcement Learning in Non-Stationary Environments, 2005.

M. L. Littman, A. R. Cassandra, and L. P. Kaelbling, Learning policies for partially observable environments: Scaling up, Proc. ICML, 1995.
DOI : 10.1016/B978-1-55860-377-6.50052-9

P. Poupart, N. Vlassis, J. Hoey, and K. Regan, An analytic solution to discrete Bayesian reinforcement learning, Proceedings of the 23rd international conference on Machine learning , ICML '06, pp.697-704, 2006.
DOI : 10.1145/1143844.1143932

L. Ren, L. Carin, and D. B. Dunson, The dynamic hierarchical Dirichlet process, Proceedings of the 25th international conference on Machine learning, ICML '08, 2008.
DOI : 10.1145/1390156.1390260

S. Ross, B. Chaib-draa, and J. Pineau, Bayes-adaptive POMDPs, Proc. NIPS, 2008.

S. Ross, B. Chaib-draa, and J. Pineau, Bayesian reinforcement learning in continuous POMDPs with application to robot navigation, 2008 IEEE International Conference on Robotics and Automation, 2008.
DOI : 10.1109/ROBOT.2008.4543641

S. Ross, J. Pineau, S. Paquet, and B. Chaib-draa, Online planning algorithms for pomdps, Journal of Artificial Intelligence Research, vol.32, pp.663-704, 2008.

J. Sethuraman, A constructive definition of the Dirichlet prior, Statistica Sinica, vol.2, pp.639-650, 1994.

G. Shani, J. Pineau, and R. Kaplow, A survey of point-based POMDP solvers, Autonomous Agents and Multi-Agent Systems, vol.17, issue.2, pp.1-51, 2012.
DOI : 10.1007/s10458-012-9200-2

D. Siegmund, Importance Sampling in the Monte Carlo Study of Sequential Tests, The Annals of Statistics, vol.4, issue.4, pp.673-684, 1976.
DOI : 10.1214/aos/1176343541

Y. W. Teh, M. I. Jordan, M. J. Beal, and D. M. Blei, Hierarchical Dirichlet Processes, Journal of the American Statistical Association, vol.101, issue.476, pp.1566-1581, 2006.
DOI : 10.1198/016214506000000302

G. Theocharous and L. P. Kaelbling, Approximate planning in POMDPs with macroactions, Proc. NIPS, 2003.