Abstract : We provide a proof of the performance bound theorem published in "Least-Squares λ Policy Iteration: Bias-Variance Trade-off in Control Problems" (ICML 2010).
https://hal.inria.fr/inria-00480952 Contributor : Christophe ThieryConnect in order to contact the contributor Submitted on : Wednesday, May 5, 2010 - 2:56:27 PM Last modification on : Wednesday, February 2, 2022 - 3:51:27 PM Long-term archiving on: : Thursday, September 16, 2010 - 1:36:05 PM