A. C. and D. J. Zilberstein-s, Incremental policy generation for finite-horizon DEC-POMDPs, ICAPS, 2009.

A. R. Dutech-a, An investigation into mathematical programming for finite horizon decentralized POMDPs, JAIR, vol.37, pp.329-396, 2010.

B. J. , K. S. Ng-a, and . Schneider-j, Policy search by dynamic programming, NIPS, 2003.

B. D. Amato-c, . A. Hansen-e, and . Zilberstein-s, Policy iteration for decentralized control of Markov decision processes, JAIR, vol.34, pp.89-132, 2009.

B. D. Givan-r and I. N. Zilberstein-s, The complexity of decentralized control of Markov decision processes, Mathematics of Operations Research, vol.27, issue.4, 2002.

B. A. Chaib-draa, Exact dynamic programming for decentralized POMDPs with lossless policy compression, ICAPS, pp.20-27, 2008.

C. A. Mouaddib-a.-i, Collective decision under partial observability -a dynamic local interaction model, IJCCI (ECTA-FCTA), pp.146-155, 2011.

D. Farias, D. P. Van, and R. B. , The Linear Programming Approach to Approximate Dynamic Programming, Operations Research, vol.51, issue.6, pp.850-865, 2003.
DOI : 10.1287/opre.51.6.850.24925

D. J. Amato-c and . Doniec-a, Scaling up decentralized MDPs through heuristic search, UAI, pp.217-226, 2012.

D. J. Amato-c and D. A. Charpillet-f, Producing efficient error-bounded solutions for transition independent decentralized MDPs, AAMAS, 2013.

H. E. Bernstein-d and . Zilberstein-s, Dynamic programming for partially observable stochastic games, AAAI, pp.709-715, 2004.

K. A. Zilberstein-s, Constraint-based dynamic programming for decentralized POMDPs with structured interactions, AAMAS, pp.561-568, 2009.

N. R. Varakantham-p and T. M. Yokoo-m, Networked distributed POMDPs : A synthesis of distributed constraint optimization and POMDPs, AAAI, pp.133-139, 2005.

O. F. Spaan-m, . Amato-c, and . Whiteson-s, Incremental clustering and expansion for faster optimal planning in Dec-POMDPs, pp.449-509, 2013.

O. F. Spaan-m and . A. Vlassis-n, Optimal and approximate Q-value functions for decentralized POMDPs, JAIR, vol.32, pp.289-353, 2008.

O. F. and W. S. Spaan-m, Lossless clustering of histories in decentralized POMDPs, AAMAS, pp.577-584, 2009.

S. G. and P. J. Kaplow-r, A survey of point-based POMDP solvers, JAAMAS, pp.1-51, 2012.

S. R. Sondik-e, The optimal control of partially observable Markov decision processes over a finite horizon, Operations Research, vol.21, issue.5, pp.1071-1088, 1973.

S. T. Simmons-r, Heuristic search value iteration for POMDPs, UAI, pp.520-527, 2004.

S. M. Oliehoek-f and . Amato-c, Scaling up optimal heuristic search in Dec- POMDPs via incremental expansion, IJCAI, pp.2027-2032, 2011.

S. D. and C. F. Zilberstein-s, MAA* : A heuristic search algorithm for solving decentralized POMDPs, UAI, pp.568-576, 2005.