. Aberdeen, Decision-theoretic military operations planning, Proceedings of the Fourteenth International Conference on Automated Planning and Scheduling (ICAPS'04), 2004.

S. Abramowitz, I. A. Abramowitz, and . Stegunbagnell, Handbook of Mathematical Functions with Formulas, Graphs, and Mathematical Tables Solving uncertain markov decision problems, 1972.

. Barto, Learning to act using realtime dynamic programming, Artificial Intelligence, vol.72, 1995.

G. Bonet, H. Bonet, and . Geffner, Labeled rtdp: Improving the convergence of real time dynamic programming, Proceedings of the Thirteenth International Conference on Automated Planning and Scheduling (ICAPS'03), 2003.

. Givan, Bounded-parameter Markov decision processes, Artificial Intelligence, vol.122, issue.1-2, pp.71-109, 2000.
DOI : 10.1016/S0004-3702(00)00047-3

URL : http://doi.org/10.1016/s0004-3702(00)00047-3

. Hosaka, Controlled Markov set-chains under average criteria, Applied Mathematics and Computation, vol.120, issue.1-3, pp.1-3195, 2001.
DOI : 10.1016/S0096-3003(99)00241-6

]. R. Munos, Efficient resources allocation for markov decision processes, Advances in Neural Information Processing Systems 13 (NIPS'01), 2001.

G. Nilim, L. Nilim, and . Ghaoui, Robustness in markov decision problems with uncertain transition matrices, Advances in Neural Information Processing Systems 16 (NIPS'03), 2004.

B. D. Patek, D. P. Patek, and . Bertsekas, Stochastic Shortest Path Games, SIAM Journal on Control and Optimization, vol.37, issue.3, pp.804-824, 1999.
DOI : 10.1137/S0363012996299557

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.15.4053

G. Peret, F. Peret, and . Garcia, On-line search for solving markov decision processes via heuristic sampling, Proceedings of the 16th European Conference on Artificial Intelligence, 2004.

L. L. Strehl, M. L. Strehl, and . Littman, An empirical evaluation of interval estimation for Markov decision processes, 16th IEEE International Conference on Tools with Artificial Intelligence, 2004.
DOI : 10.1109/ICTAI.2004.28

B. Sutton, G. Sutton, . Barto, J. Robert, and . Vanderbei, Reinforcement Learning: an introduction Optimal sailing strategies, statistics and operations research program, 1996.
DOI : 10.1007/978-1-4615-3618-5

. Weissman, Inequalities for the l1 deviation of the empirical distribution, 2003.

L. L. Younes, M. L. Younes, and . Littman, Ppddl1.0: An extension to pddl for expressing planning domains with probabilistic effects, 2004.