Multiagent reinforcement learning: algorithm converging to nash equilibrium in general-sum discounted stochastic games, Proc. of AAMAS, 2009. ,
Neuro- Dynamic Programming, 1996. ,
Beyond the Interface: Co-evolution Inside Interactive Systems ??? A Proposal Founded on Activity Theory, People and Computers XV-Interaction without Frontiers, pp.297-310, 2001. ,
DOI : 10.1007/978-1-4471-0353-0_18
Multiagent learning using a variable learning rate, Artificial Intelligence, vol.136, issue.2, pp.215-250, 2002. ,
DOI : 10.1016/S0004-3702(02)00121-2
A comprehensive survey of multiagent reinforcement learning. Systems, Man, and Cybernetics , Part C: Applications and Reviews, IEEE Transactions on, vol.38, issue.2, pp.156-172, 2008. ,
Dialogue et théorie des jeux, Congrés international SPeD, 2011. ,
User Simulation in Dialogue Systems using Inverse Reinforcement Learning, Proc. of Interspeech, 2011. ,
URL : https://hal.archives-ouvertes.fr/hal-00652446
Behavior Specific User Simulation in Spoken Dialogue Systems, Proc. of ITG Conference on Speech Com- munication, 2012. ,
URL : https://hal.archives-ouvertes.fr/hal-00749421
Coadaptation in Spoken Dialogue Systems, Proc. of IWSDS, 2012. ,
URL : https://hal.archives-ouvertes.fr/hal-00778752
A Comprehensive Reinforcement Learning Framework for Dialogue Management Optimization, IEEE Journal of Selected Topics in Signal Processing, vol.6, issue.8, pp.891-902, 2012. ,
DOI : 10.1109/JSTSP.2012.2229257
Learning non-cooperative dialogue behaviours, Proceedings of the 15th Annual Meeting of the Special Interest Group on Discourse and Dialogue (SIGDIAL), 2014. ,
DOI : 10.3115/v1/W14-4308
Dinasti : Dialogues with a negotiating appointment setting interface, Proc. of LREC, 2014. ,
URL : https://hal.archives-ouvertes.fr/hal-01107496
Learning mixed initiative dialog strategies by using reinforcement learning on both conversants, Proc. of HLT/EMNLP, 2005. ,
Tree-based batch mode reinforcement learning, pp.503-556, 2005. ,
Competitive Markov decision processes, 1996. ,
DOI : 10.1007/978-1-4612-4054-9
Tracking in Reinforcement Learning, Proc. of ICONIP, 2009. ,
DOI : 10.1007/978-3-642-10677-4_57
URL : https://hal.archives-ouvertes.fr/hal-00439316
Single-Agent vs. Multi-Agent Techniques for Concurrent Reinforcement Learning of Negotiation Dialogue Policies, Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2014. ,
DOI : 10.3115/v1/P14-1047
Approximate Solutions to Markov Decision Processes, 1999. ,
Stationary equilibria in stochastic games: structure, selection and computation, 2000. ,
Nash qlearning for general-sum stochastic games, Journal of Machine Learning Research, vol.4, pp.1039-1069, 2003. ,
Optimising a handcrafted dialogue system design, Proc. of Interspeech, 2010. ,
Machine learning for spoken dialogue systems, Proc. of Interspeech, 2007. ,
URL : https://hal.archives-ouvertes.fr/hal-00216035
A stochastic model of computer-human interaction for learning dialogue strategies, Proc. of Eurospeech, 1997. ,
Markov games as a framework for multi-agent reinforcement learning, Proc. of ICML, 1994. ,
Friend-or-foe q-learning in general-sum games, Proc. of ICML, 2001. ,
Games against nature, 1951. ,
Stochastic games and applications, 2003. ,
DOI : 10.1007/978-94-010-0189-2
A course in game theory, 1994. ,
Stochastic shortest path games, SIAM Journal on Control and Optimization, vol.37, issue.3, 1999. ,
Approximate dynamic programming for two-player zero-sum markov games, Proc. of ICML, 2015. ,
URL : https://hal.archives-ouvertes.fr/hal-01153270
A survey on metrics for the evaluation of user simulations. The knowledge engineering review, pp.59-73, 2013. ,
URL : https://hal.archives-ouvertes.fr/hal-00771654
Sampleefficient batch reinforcement learning for dialogue management optimization, ACM Transactions on Speech and Language Processing, vol.7, issue.3, 2011. ,
URL : https://hal.archives-ouvertes.fr/hal-00617517
Consistent goal-directed user model for realistic man-machine task-oriented spoken dialogue simulation, Proc of ICME, 2006. ,
URL : https://hal.archives-ouvertes.fr/hal-00215968
Algorithms for nash equilibria in general-sum stochastic games, Proc. of AAMAS, 2015. ,
Markov decision processes: discrete stochastic dynamic programming, 1994. ,
Agenda-based user simulation for bootstrapping a POMDP dialogue system, Human Language Technologies 2007: The Conference of the North American Chapter of the Association for Computational Linguistics; Companion Volume, Short Papers on XX, NAACL '07, 2007. ,
DOI : 10.3115/1614108.1614146
Effects of the user model on simulation-based learning of dialogue strategies, Proc. of ASRU, 2005. ,
Stochastic games, Proc. of the National Academy of Sciences of the United States of America, pp.1095-1100, 1953. ,
Reinforcement learning for spoken dialogue systems, Proc. of NIPS, 1999. ,
Reinforcement learning: An introduction, 1998. ,
POMDP-Based Statistical Spoken Dialog Systems: A Review, Proceedings of the IEEE, pp.1160-1179, 2013. ,
DOI : 10.1109/JPROC.2012.2225812
Cyclic equilibria in markov games, Proc. of NIPS, 2006. ,