Reinforcement Learning: An Introduction, IEEE Transactions on Neural Networks, vol.9, issue.5, 1998. ,
DOI : 10.1109/TNN.1998.712192
Residual algorithms: Reinforcement learning with function approximation, International Conference on Machine Learning, pp.30-37, 1995. ,
Least-squares temporal difference learning, Proc. 16th International Conference on Machine Learning, pp.49-56, 1999. ,
Least squares policy evaluation algorithms with linear function approximation. Discrete Event Dynamic Systems, pp.79-110, 2003. ,
Incremental least-squares temporal difference learning, Proceeding of American Association for Artificial Intelligence (AAAI), pp.356-361, 2006. ,
iLSTD: Eligibility traces & convergence analysis, Proceeding of Neural Information Processing Systems Conference, 2006. ,
A unified view of td algorithms ? introducing full-gradient td and equi-gradient descent td, 2006. ,
URL : https://hal.archives-ouvertes.fr/inria-00116936
Equi-gradient descent, 2006. ,
URL : https://hal.archives-ouvertes.fr/inria-00116936
An analysis of temporal-difference learning with function approximation, 1996. ,