Decision-theoretic military operations planning, Proceedings of the Fourteenth International Conference on Automated Planning and Scheduling (ICAPS'04), 2004. ,
Handbook of Mathematical Functions with Formulas, Graphs, and Mathematical Tables Solving uncertain markov decision problems, 1972. ,
Learning to act using realtime dynamic programming, Artificial Intelligence, vol.72, 1995. ,
Labeled rtdp: Improving the convergence of real time dynamic programming, Proceedings of the Thirteenth International Conference on Automated Planning and Scheduling (ICAPS'03), 2003. ,
Bounded-parameter Markov decision processes, Artificial Intelligence, vol.122, issue.1-2, pp.71-109, 2000. ,
DOI : 10.1016/S0004-3702(00)00047-3
URL : http://doi.org/10.1016/s0004-3702(00)00047-3
Controlled Markov set-chains under average criteria, Applied Mathematics and Computation, vol.120, issue.1-3, pp.1-3195, 2001. ,
DOI : 10.1016/S0096-3003(99)00241-6
Efficient resources allocation for markov decision processes, Advances in Neural Information Processing Systems 13 (NIPS'01), 2001. ,
Robustness in markov decision problems with uncertain transition matrices, Advances in Neural Information Processing Systems 16 (NIPS'03), 2004. ,
Stochastic Shortest Path Games, SIAM Journal on Control and Optimization, vol.37, issue.3, pp.804-824, 1999. ,
DOI : 10.1137/S0363012996299557
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.15.4053
On-line search for solving markov decision processes via heuristic sampling, Proceedings of the 16th European Conference on Artificial Intelligence, 2004. ,
An empirical evaluation of interval estimation for Markov decision processes, 16th IEEE International Conference on Tools with Artificial Intelligence, 2004. ,
DOI : 10.1109/ICTAI.2004.28
Reinforcement Learning: an introduction Optimal sailing strategies, statistics and operations research program, 1996. ,
DOI : 10.1007/978-1-4615-3618-5
Inequalities for the l1 deviation of the empirical distribution, 2003. ,
Ppddl1.0: An extension to pddl for expressing planning domains with probabilistic effects, 2004. ,