Simultaneous analysis of Lasso and Dantzig selector. The Annals of Statistics, pp.1705-1732, 2009. ,
Linear Least-Squares algorithms for temporal difference learning, Machine Learning, pp.33-57, 1996. ,
The Dantzig selector : statistical estimation when p is much larger than n, Annals of Statistics, vol.35, issue.6, pp.2313-2351, 2007. ,
Least Angle Regression, Annals of Statistics, vol.32, issue.2, pp.407-499, 2004. ,
Regularized Policy Iteration, Proc. of NIPS 21, 2008. ,
Model selection in reinforcement learning, Machine Learning Journal, vol.85, issue.3, pp.299-332, 2011. ,
A Brief Survey of Parametric Value Function Approximation, 2010. ,
1 -penalized projected Bellman residual, Proc. of EWRL 9, 2011. ,
Finite-Sample Analysis of Lasso- TD, Proc. of ICML, 2011. ,
Regularized Least Squares Temporal Difference learning with nested 2 and 1 penalization, Proc. of EWRL 9, 2011. ,
Linear Complementarity for Regularized Policy Evaluation and Improvement, Proc. of NIPS 23, pp.1009-1017, 2010. ,
Regularization and Feature Selection in Least-Squares Temporal Difference Learning, Proc. of ICML, 2009. ,
Reinforcement Learning, 1998. ,
DOI : 10.1016/B978-012526430-3/50003-9
Error Bounds for Approximations from Projected Linear Equations, Mathematics of Operations Research, vol.35, pp.306-329, 2010. ,