Linear least-squares algorithms for temporal difference learning, pp.33-57, 1996. ,
Shaping multi-agent systems with gradient reinforcement learning, Autonomous Agent and Multi-Agent System Journal (AAMASJ), pp.197-220, 2007. ,
Processus Décisionnels de Markov en Intelligence Artificielle. (Edité par Olivier Buffet et Olivier Sigaud, 2008. ,
Least-squares methods in reinforcement learning for control, Proc ; of the 2nd Hellenic Conference on Artificial Intelligence (SETN-02), number 2308 in Lecture Notes on Artificial Intelligence, pp.249-260, 2002. ,
Developmental robotics : a survey, Connection Science, vol.15, issue.4, pp.151-190, 2003. ,
Some philosophical problems from the standpoint of artificial intelligence, Machine Intelligence, vol.4, pp.463-502, 1969. ,
Policy invariance under reward transformations : Theory and application to reward shaping, Proceedings of the Sixteenth International Conference on Machine Learning, ICML-99, pp.278-287, 1999. ,
Intrinsic motivation systems for autonomous mental development, IEEE Transactions on Evolutionnary Computation, vol.11, issue.2, pp.265-286, 2007. ,
Dynamic Self-Organising Map, Neurocomputing, vol.74, issue.11, pp.1840-1847, 2011. ,
URL : https://hal.archives-ouvertes.fr/inria-00495827
Apprentissage par renforcement développemental en robotique autonome, Conférence Francophone d'Apprentissage, 2011. ,
Reinforcement Learning, 1998. ,
DOI : 10.1016/B978-012526430-3/50003-9
The role of developmental limitations of sensory input on sensory/perceptual organization, Developmental & Behavioral Pediatrics, vol.6, issue.5, pp.302-306, 1985. ,