Fitted Q-iteration in continuous actionspace MDPs, Neural Information Processing Systems, 2007. ,
URL : https://hal.archives-ouvertes.fr/inria-00185311
A theory of learning from different domains, Machine Learning, vol.60, issue.1-2, pp.151-175, 2010. ,
DOI : 10.1007/s10994-009-5152-4
Learning from multiple sources, Journal of Machine Learning Research, vol.9, pp.1757-1774, 2008. ,
Tree-based batch mode reinforcement learning, J. Mach. Learn. Res, vol.6, pp.503-556, 2005. ,
A distribution-free theory of nonparametric regression, 2002. ,
DOI : 10.1007/b97848
Least-squares policy iteration, The Journal of Machine Learning Research, vol.4, pp.1107-1149, 2003. ,
Finite-sample analysis of LSTD, 2010. ,
URL : https://hal.archives-ouvertes.fr/inria-00482189
Transfer of samples in batch reinforcement learning, Proceedings of the 25th international conference on Machine learning, ICML '08, pp.544-551, 2008. ,
DOI : 10.1145/1390156.1390225
Knowledge Transfer in Reinforcement Learning, 2008. ,
Domain adaptation: Learning bounds and algorithms, Proceedings of the 22nd Conference on Learning Theory, 2009. ,
Finite time bounds for fitted value iteration, Journal of Machine Learning Research, vol.9, pp.815-857, 2008. ,
URL : https://hal.archives-ouvertes.fr/inria-00120882
Reinforcement Learning: An Introduction, 1998. ,
Transferring Instances for Model-Based Reinforcement Learning, Proceedings of the European Conference on Machine Learning (ECML'08), pp.488-505, 2008. ,
DOI : 10.1007/978-3-540-87481-2_32
Transfer learning for reinforcement learning domains: A survey, Journal of Machine Learning Research, vol.10, issue.1, pp.1633-1685, 2009. ,