On Gibbs sampling for state space models, Biometrika, vol.81, issue.3, pp.541-553, 1994. ,
DOI : 10.1093/biomet/81.3.541
Reinforcement learning with perceptual aliasing: The perceptual distinctions approach, Proc. AAAI, pp.183-188, 1992. ,
The infinite partially observable Markov decision process, Proc. NIPS, 2009. ,
Gibbs Sampling Methods for Stick-Breaking Priors, Journal of the American Statistical Association, vol.96, issue.453, pp.161-173, 2001. ,
DOI : 10.1198/016214501750332758
Learning in non-stationary Partially Observable Markov Decision Processes, ECML Workshop on Reinforcement Learning in Non-Stationary Environments, 2005. ,
Learning policies for partially observable environments: Scaling up, Proc. ICML, 1995. ,
DOI : 10.1016/B978-1-55860-377-6.50052-9
An analytic solution to discrete Bayesian reinforcement learning, Proceedings of the 23rd international conference on Machine learning , ICML '06, pp.697-704, 2006. ,
DOI : 10.1145/1143844.1143932
The dynamic hierarchical Dirichlet process, Proceedings of the 25th international conference on Machine learning, ICML '08, 2008. ,
DOI : 10.1145/1390156.1390260
Bayes-adaptive POMDPs, Proc. NIPS, 2008. ,
Bayesian reinforcement learning in continuous POMDPs with application to robot navigation, 2008 IEEE International Conference on Robotics and Automation, 2008. ,
DOI : 10.1109/ROBOT.2008.4543641
Online planning algorithms for pomdps, Journal of Artificial Intelligence Research, vol.32, pp.663-704, 2008. ,
A constructive definition of the Dirichlet prior, Statistica Sinica, vol.2, pp.639-650, 1994. ,
A survey of point-based POMDP solvers, Autonomous Agents and Multi-Agent Systems, vol.17, issue.2, pp.1-51, 2012. ,
DOI : 10.1007/s10458-012-9200-2
Importance Sampling in the Monte Carlo Study of Sequential Tests, The Annals of Statistics, vol.4, issue.4, pp.673-684, 1976. ,
DOI : 10.1214/aos/1176343541
Hierarchical Dirichlet Processes, Journal of the American Statistical Association, vol.101, issue.476, pp.1566-1581, 2006. ,
DOI : 10.1198/016214506000000302
Approximate planning in POMDPs with macroactions, Proc. NIPS, 2003. ,