E. Altman, Applications of Markov Decision Processes in Communication Networks, 2000.
DOI : 10.1007/978-1-4615-0805-2_16
URL : https://hal.archives-ouvertes.fr/inria-00072663

R. Becker, S. Zilberstein, V. Lesser, and C. V. Goldman, Solving transition independent decentralized markov decision processes, Journal of Artificial Intelligence Research, vol.22, pp.423-455, 2004.
DOI : 10.1145/860581.860583
URL : http://anytime.cs.umass.edu/shlomo/papers/aamas03a.pdf

D. S. Bernstein, R. Givan, N. Immerman, and S. Zilberstein, The Complexity of Decentralized Control of Markov Decision Processes, Mathematics of Operations Research, vol.27, issue.4, pp.819-840, 2002.
DOI : 10.1287/moor.27.4.819.297

D. S. Bernstein, E. A. Hansen, and S. Zilberstein, Bounded policy iteration for decentralized POMDPs, Proceedings of the 24th International Joint Conference on Artificial Intelligence, 2005.

C. Claus and C. Boutilier, The dynamics of reinforcement learning in cooperative multiagent systems, AAAI/IAAI, pp.746-752, 1998.

R. Emery-montemerlo, G. Gordon, J. Schneider, S. Thrun, P. J. Gmytrasiewicz et al., Approximate solutions for partially observable stochastic games with common payoffs A framework for sequential planning in multi-agent settings, Proceedings of the 3rd AAMAS, pp.49-79, 2004.

E. A. Hansen, D. S. Bernstein, and S. Zilberstein, Dynamic programming for partially observable stochastic games, Proceedings of the 19th National Conference on Artificial Intelligence, 2004.

W. S. Lovejoy, Computationally Feasible Bounds for Partially Observed Markov Decision Processes, Operations Research, vol.39, issue.1, pp.162-175, 1991.
DOI : 10.1287/opre.39.1.162

R. Nair, M. Tambe, M. Yokoo, D. Pynadath, and S. Marsella, Taming decentralized pomdps: Towards efficient policy computation for multiagent settings, Proceedings of the 18th International Joint Conference on Artificial Intelligence, 2003.

L. Peshkin, K. Kim, N. Meuleau, and L. Kaelbling, Learning to cooperate via policy search, Proceedings of the 16th Conference on Uncertainty in Artificial In- telligence, 2000.

J. Pineau, G. Gordon, and S. Thrun, Point-based value iteration: An anytime algorithm for pomdps, Proceedings of the 18th International Joint Conference on Artificial Intelligence, 2003.

Y. Shirai, A. J. Osgood, Y. Zhao, K. F. Kelly, and J. M. Tour, Directional control in thermally driven singlemolecule nanocars, Nano Letters, vol.5, issue.11, 2005.

D. Szer, C. , and F. , An Optimal Best-First Search Algorithm for Solving Infinite Horizon DEC-POMDPs, Proceedings of the 16st European Conference on Machine Learning, 2005.
DOI : 10.1007/11564096_38
URL : https://hal.archives-ouvertes.fr/inria-00000205

D. Szer, F. Charpillet, and S. Zilberstein, MAA*: A heuristic search algorithm for solving decentralized POMDPs Winning back the cup for distributed pomdps: Planning over continuous belief spaces, Proceedings of the 21st Conference on Uncertainty in Artificial Intelligence Proceedings of the 5th International Joint Conference on Autonomous Agents and Multiagent Systems, 2005.