Optimizing fixed-size stochastic controllers for POMDPs and decentralized POMDPs, Autonomous Agents and Multi-Agent Systems, vol.24, issue.3, pp.293-320, 2010. ,
DOI : 10.1007/s10458-009-9103-z
Decentralized control of partially observable Markov decision processes, 52nd IEEE Conference on Decision and Control, 2013. ,
DOI : 10.1109/CDC.2013.6760239
Incremental policy generation for finitehorizon DEC-POMDPs, Proceedings of the Nineteenth International Conference on Automated Planning and Scheduling, 2009. ,
Policy search for multi-robot coordination under uncertainty, Proceedings of the Robotics: Science and Systems Conference, 2015. ,
DOI : 10.1007/11564096_38
Planning with macro-actions in decentralized POMDPs, Proceedings of the Thirteenth International Conference on Autonomous Agents and Multiagent Systems, 2014. ,
An investigation into mathematical programming for finite horizon decentralized POMDPs, Journal of Artificial Intelligence Research, vol.37, pp.329-396, 2010. ,
URL : https://hal.archives-ouvertes.fr/inria-00439627
Sample bounded distributed reinforcement learning for decentralized POMDPs, Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence, pp.1256-1262, 2012. ,
Learning to act using real-time dynamic programming, Artificial Intelligence, vol.72, issue.1-2, pp.81-138, 1995. ,
DOI : 10.1016/0004-3702(94)00011-O
Solving transition independent decentralized Markov decision processes, Journal of Artificial Intelligence Research, vol.22, pp.423-455, 2004. ,
Dynamic Programming, 1957. ,
The Complexity of Decentralized Control of Markov Decision Processes, Mathematics of Operations Research, vol.27, issue.4, 2002. ,
DOI : 10.1287/moor.27.4.819.297
Exact dynamic programming for decentralized POMDPs with lossless policy compression, Proceedings of the Eighteenth International Conference on Automated Planning and Scheduling, pp.20-27, 2008. ,
Collective decision under partial observability -a dynamic local interaction model, IJCCI (ECTA-FCTA), pp.146-155, 2011. ,
URL : https://hal.archives-ouvertes.fr/hal-00969318
Value-based observation compression for DEC-POMDPs, Proceedings of the Seventh International Conference on Autonomous Agents and Multiagent Systems, 2008. ,
The Linear Programming Approach to Approximate Dynamic Programming, Operations Research, vol.51, issue.6, pp.850-865, 2003. ,
DOI : 10.1287/opre.51.6.850.24925
Exploiting separability in multiagent planning with continuous-state MDPs, Proceedings of the Thirteenth International Conference on Autonomous Agents and Multiagent Systems, pp.1281-1288, 2014. ,
URL : https://hal.archives-ouvertes.fr/hal-01092066
Exploiting separability in multiagent planning with continuous-state MDPs (extended abstract), Proceedings of the Twenty- Fifth International Joint Conference on Artificial Intelligence, pp.4254-4260, 2015. ,
URL : https://hal.archives-ouvertes.fr/hal-01188483
Scaling up decentralized MDPs through heuristic search, Proceedings of the Twenty-Eighth Conference on Uncertainty in Artificial Intelligence, pp.217-226, 2012. ,
URL : https://hal.archives-ouvertes.fr/hal-00765221
Producing efficient error-bounded solutions for transition independent decentralized MDPs, Proceedings of the Twelfth International Conference on Autonomous Agents and Multiagent Systems, pp.539-546, 2013. ,
URL : https://hal.archives-ouvertes.fr/hal-00918066
Structural results for cooperative decentralized control models, Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, pp.46-52, 2015. ,
URL : https://hal.archives-ouvertes.fr/hal-01188481
Point-based incremental pruning heuristic for solving finite-horizon DEC-POMDPs, Proceedings of the Eighth International Conference on Autonomous Agents and Multiagent Systems, pp.569-576, 2009. ,
Toward error-bounded algorithms for infinite-horizon Dec-POMDPs, Proceedings of the Tenth International Conference on Autonomous Agents and Multiagent Systems, pp.947-954, 2011. ,
URL : https://hal.archives-ouvertes.fr/hal-00969579
Topological order planner for POMDPs, Proceedings of the Twenty-Second International Joint Conference on Artificial Intelligence, pp.1684-1689, 2009. ,
URL : https://hal.archives-ouvertes.fr/hal-00965737
Optimally solving Dec-POMDPs as continuous-state MDPs, Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2013. ,
URL : https://hal.archives-ouvertes.fr/hal-00907338
Error-bounded approximations for infinitehorizon discounted decentralized POMDPs, Proceedings of the Twenty-Fourth European Conference on Machine Learning, pp.338-353, 2014. ,
URL : https://hal.archives-ouvertes.fr/hal-01096610
A novel prioritization technique for solving Markov decision processes, Proceedings of the 21th International Conference of the Florida Artificial Intelligence Research Society, pp.537-542, 2008. ,
Decentralized control of a multiaccess broadcast network, 1981 20th IEEE Conference on Decision and Control including the Symposium on Adaptive Processes, pp.390-391, 1981. ,
DOI : 10.1109/CDC.1981.269554
Dynamic programming for partially observable stochastic games, Proceedings of the Nineteenth National Conference on Artificial Intelligence, pp.709-715, 2004. ,
LAO???: A heuristic search algorithm that finds solutions with loops, Artificial Intelligence, vol.129, issue.1-2, pp.35-62, 2001. ,
DOI : 10.1016/S0004-3702(01)00106-0
A Formal Basis for the Heuristic Determination of Minimum Cost Paths, IEEE Transactions on Systems Science and Cybernetics, vol.4, issue.2, pp.100-107, 1968. ,
DOI : 10.1109/TSSC.1968.300136
Value-function approximations for partially observable Markov decision processes, Journal of Artificial Intelligence Research, vol.13, pp.33-94, 2000. ,
Dynamic Programming and Markov Processes. The M.I, 1960. ,
DCOPs meet the real world: Exploring unknown reward matrices with applications to mobile sensor networks, Proceedings of the Twenty-Second International Joint Conference on Artificial Intelligence, pp.181-186, 2009. ,
Planning and acting in partially observable stochastic domains, Artificial Intelligence, vol.101, issue.1-2, pp.99-134, 1998. ,
DOI : 10.1016/S0004-3702(98)00023-X
Real-time heuristic search, Artificial Intelligence, vol.42, issue.2-3, pp.189-211, 1990. ,
DOI : 10.1016/0004-3702(90)90054-4
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.161.809
Constraint-based dynamic programming for decentralized POMDPs with structured interactions, Proceedings of the Eighth International Conference on Autonomous Agents and Multiagent Systems, pp.561-568, 2009. ,
Point-based backup for decentralized POMDPs: complexity and new algorithms, Proceedings of the Ninth International Conference on Autonomous Agents and Multiagent Systems, pp.1315-1322, 2010. ,
Taming decentralized POMDPs: Towards efficient policy computation for multiagent settings, Proceedings of the Eighteenth International Joint Conference on Artificial Intelligence, pp.705-711, 2003. ,
Networked distributed POMDPs: A synthesis of distributed constraint optimization and POMDPs, Proceedings of the Twentieth National Conference on Artificial Intelligence, pp.133-139, 2005. ,
Decentralized POMDPs, Reinforcement Learning: State of the Art, pp.471-503, 2012. ,
DOI : 10.1007/978-3-642-27645-3_15
Sufficient plan-time statistics for decentralized POMDPs, Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2013. ,
Incremental clustering and expansion for faster optimal planning in Dec-POMDPs, Journal of Artificial Intelligence Research, vol.46, pp.449-509, 2013. ,
Optimal and approximate Q-value functions for decentralized POMDPs, Journal of Artificial Intelligence Research, vol.32, pp.289-353, 2008. ,
DOI : 10.1145/1329125.1329390
URL : http://orbilu.uni.lu/handle/10993/11032
Tree-based solution methods for multiagent POMDPs with delayed communication, Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence, 2012. ,
Lossless clustering of histories in decentralized POMDPs, Proceedings of the Eighth International Conference on Autonomous Agents and Multiagent Systems, pp.577-584, 2009. ,
Decentralized control of a multiple access broadcast channel: performance bounds, Proceedings of 35th IEEE Conference on Decision and Control, pp.293-298, 1996. ,
DOI : 10.1109/CDC.1996.574318
Optimizing Spatial and Temporal Reuse in Wireless Networks by Decentralized Partially Observable Markov Decision Processes, IEEE Transactions on Mobile Computing, vol.13, issue.4, 2013. ,
DOI : 10.1109/TMC.2013.39
Task allocation learning in a multiagent environment: Application to the RoboCupRescue simulation, Multiagent and Grid Systems, vol.6, issue.4, pp.293-314, 2010. ,
DOI : 10.3233/MGS-2010-0153
Anytime point-based approximations for large POMDPs, Journal of Artificial Intelligence Research, vol.27, pp.335-380, 2006. ,
Approximate Dynamic Programming: Solving the Curses of Dimensionality, 2007. ,
DOI : 10.1002/9781118029176
Markov Decision Processes, Discrete Stochastic Dynamic Programming, 1994. ,
Finding approximate POMDP solutions through belief compression, Journal of Artificial Intelligence Research, vol.23, pp.1-40, 2005. ,
Improved memory-bounded dynamic programming for DEC- POMDPs, Proceedings of the Twenty-Third Conference on Uncertainty in Artificial Intel- ligence, 2007. ,
A survey of point-based POMDP solvers, Autonomous Agents and Multi-Agent Systems, vol.17, issue.2, pp.1-51, 2013. ,
DOI : 10.1007/s10458-012-9200-2
The Optimal Control of Partially Observable Markov Processes over a Finite Horizon, Operations Research, vol.21, issue.5, pp.1071-1088, 1973. ,
DOI : 10.1287/opre.21.5.1071
Probabilistic Planning for Robotic Exploration The Robotics Institute, 2007. ,
Heuristic search value iteration for POMDPs, Proceedings of the Twentieth Conference on Uncertainty in Artificial Intelligence, pp.520-527, 2004. ,
Focused real-time dynamic programming for MDPs: Squeezing more out of a heuristic, Proceedings of the Twenty-First AAAI Conference on Artificial Intelligence, pp.1227-1232, 2006. ,
Scaling up optimal heuristic search in Dec- POMDPs via incremental expansion, Proceedings of the Twenty-Third International Joint Conference on Artificial Intelligence, pp.2027-2032, 2011. ,
MAA*: A heuristic search algorithm for solving decentralized POMDPs, Proceedings of the Twenty-First Conference on Uncertainty in Artificial Intelligence, pp.568-576, 2005. ,
URL : https://hal.archives-ouvertes.fr/inria-00000204
Feature-based methods for large scale dynamic programming, Machine Learning, pp.1-3, 1996. ,
Distributed model shaping for scaling to decentralized POMDPs with hundreds of agents, Proceedings of the Tenth International Conference on Autonomous Agents and Multiagent Systems, pp.955-962, 2011. ,
TCP ex Machina: Computer-generated congestion control, 2013. ,
Point-based policy generation for decentralized POMDPs, Proceedings of the Ninth International Conference on Autonomous Agents and Multiagent Systems, pp.1307-1314, 2010. ,
Online planning for multi-agent systems with bounded communication, Artificial Intelligence, vol.175, issue.2, pp.487-511, 2011. ,
DOI : 10.1016/j.artint.2010.09.008
Decision-Theoretic Control of Planetary Rovers, Revised Papers from the International Seminar on Advances in Plan-Based Control of Robotic Agents, pp.270-289, 2002. ,
DOI : 10.1007/3-540-37724-7_16