33869 articles – 26719 Notices  [english version]
.:. Consultation > Par auteur > Antos .:.
5 documents classés par :

fulltext access Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path
Antos A., Szepesvari C., Munos R.
Machine Learning Journal (2008) 71:89-129 [hal-00830201 - version 1]
fulltext access Fitted Q-iteration in continuous action-space MDPs
Antos A., Munos R., Szepesvari C.
In Neural Information Processing Systems (2007) [inria-00203359 - version 1]
fulltext access Fitted Q-iteration in continuous action-space MDPs
Antos A., Munos R., Szepesvari C.
(2007) [inria-00185311 - version 2]
fulltext access Value-Iteration Based Fitted Policy Iteration: Learning with a Single Trajectory
Antos A., Szepesvari C., Munos R.
In IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning (2007) 2007 [inria-00124833 - version 1]
fulltext access Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path
Antos A., Szepesvari C., Munos R.
In Conference On Learning Theory (2006) [inria-00117130 - version 1]