Neuro-dynamic programming, athena scientific, 1996. ,
Learning to act using real-time dynamic programming, Artificial Intelligence, vol.72, issue.1-2, 1993. ,
DOI : 10.1016/0004-3702(94)00011-O
Reinforcement Learning: An Introduction, IEEE Transactions on Neural Networks, vol.9, issue.5, 1998. ,
DOI : 10.1109/TNN.1998.712192
High-accuracy value-function approximation with neural networks, 2004. ,
URL : https://hal.archives-ouvertes.fr/inria-00107776
Reinforcement Learning Using Neural Networks, with Applications to Motor Control, 2002. ,
URL : https://hal.archives-ouvertes.fr/tel-00003985
Variable resolution discretization in optimal control, 1999. ,
Generalization in reinforcement learning: Successful examples using sparse coarse coding, Advances in Neural Information Processing Systems, pp.1038-1044 ,
Exponentiated gradient methods for reinforcement learning, Proc. 14th International Conference on Machine Learning, pp.272-277, 1997. ,
Application of reinforcement learning to balancing of Acrobot, IEEE SMC'99 Conference Proceedings. 1999 IEEE International Conference on Systems, Man, and Cybernetics (Cat. No.99CH37028), 1999. ,
DOI : 10.1109/ICSMC.1999.815605
Q-learning with hidden-unit restarting, Advances in Neural Information Processing Systems 5, pp.81-88, 1993. ,
Comparison of CMACs and radial basis functions for local function approximators in reinforcement learning, Proceedings of International Conference on Neural Networks (ICNN'97), 1997. ,
DOI : 10.1109/ICNN.1997.616132
Sparse Distributed Memories for On-Line Value-Based Reinforcement Learning, pp.347-358, 2004. ,
DOI : 10.1007/978-3-540-30115-8_33
A tutorial survey of reinforcement learning, 1995. ,
Error bounds for approximate value iteration, Proceedings of AAAI 2005, 2005. ,
DOI : 10.1137/040614384
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.466.4205
Data mining, ACM SIGMOD Record, vol.31, issue.1, 2005. ,
DOI : 10.1145/507338.507355
The power of decision tables, Proceedings of the European Conference on Machine Learning, pp.174-189, 1995. ,
DOI : 10.1007/3-540-59286-5_57
K*: an instance-based learner using an entropic distance measure, Proc. 12th International Conference on Machine Learning, pp.108-114, 1995. ,
Robust Regression and Outlier Detection, 1987. ,
DOI : 10.1002/0471725382
Induction of descriptive fuzzy classifiers with the logitboost algorithm. soft computing, 2005. ,
A tutorial on support vector regression, Statistics and Computing, vol.14, issue.3, 1998. ,
DOI : 10.1023/B:STCO.0000035301.49549.88
Improvements to the SMO algorithm for SVM regression, IEEE Transactions on Neural Networks, vol.11, issue.5, 1999. ,
DOI : 10.1109/72.870050
Random Number Generation and Quasi-Monte Carlo Methods, 1992. ,
DOI : 10.1137/1.9781611970081