R. Charton, A. Boyer, and F. Et-charpillet, Providing users with adapted services: Dynamic building of dialogues to make heterogeneous agents cooperate, 2002.
URL : https://hal.archives-ouvertes.fr/inria-00107577

S. Young, Probabilistic Methods in Spoken Dialog Systems, 1999.

E. Levin, R. Pieraccini, and W. Et-eckert, Using Markov decision process for learning dialogue strategies, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181)
DOI : 10.1109/ICASSP.1998.674402

C. Watkins, Learning from Delayed Rewards, 1989.

D. Goddeau, H. Meng, J. Polifroni, S. Seneff, and S. Et-busayapongchaiy, A form-based dialogue manager for spoken language applications, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96
DOI : 10.1109/ICSLP.1996.607458

R. S. Sutton and A. G. Et-barto, Reinforcement Learning: An Introduction, IEEE Transactions on Neural Networks, vol.9, issue.5, 1998.
DOI : 10.1109/TNN.1998.712192