. References, Solving uncertain markov decision problems, 2001.

. Barto, Learning to act using realtime dynamic programming Dynamic Programming, Artificial Intelligence, vol.72, 1957.

G. Bonet, H. Bonet, and . Geffner, Labeled rtdp: Improving the convergence of real time dynamic programming, Proceedings of the Thirteenth International Conference on Automated Planning and Scheduling (ICAPS'03), 2003.

]. R. Bryant, O. Buffet, and D. Aberdeen, Symbolic manipulation of boolean functions using a graphical representation Planning with robust (l)rtdp, National ICT Australia, november 2004. [Buffet and Aberdeen, 2005] O. Buffet and D. Aberdeen. Robust planning with (l)rtdp. In Proceedings of the 19th International Joint Conference on Artificial Intelligence (IJCAI'05), pp.688-694, 1985.

. Coudert, Verifying temporal properties of sequential machines without building their state diagrams, Proceedings of the Workshop on Computer-Aided Verification, 1990.
DOI : 10.1007/BFb0023716

. Givan, Bounded-parameter Markov decision processes, Artificial Intelligence, vol.122, issue.1-2, pp.71-109, 2000.
DOI : 10.1016/S0004-3702(00)00047-3

. Hosaka, Controlled Markov set-chains under average criteria, Applied Mathematics and Computation, vol.120, issue.1-3, pp.1-3195, 2001.
DOI : 10.1016/S0096-3003(99)00241-6

]. R. Munos, Efficient resources allocation for markov decision processes, Advances in Neural Information Processing Systems 13 (NIPS'01), 2001.

G. Nilim, L. Nilim, and . Ghaoui, Robustness in markov decision problems with uncertain transition matrices, Advances in Neural Information Processing Systems 16 (NIPS'03), 2004.

G. Peret, F. Peret, and . Garcia, On-line search for solving markov decision processes via heuristic sampling, Proceedings of the 16th European Conference on Artificial Intelligence, 2004.

L. L. Strehl, M. L. Strehl, and . Littman, An empirical evaluation of interval estimation for Markov decision processes, 16th IEEE International Conference on Tools with Artificial Intelligence, 2004.
DOI : 10.1109/ICTAI.2004.28

B. Sutton, G. Sutton, . Barto, J. Robert, and . Vanderbei, Reinforcement Learning: an introduction Optimal sailing strategies, statistics and operations research program, 1996.
DOI : 10.1007/978-1-4615-3618-5