A. Agarwal, S. Kakade, and L. Yang, Model-based reinforcement learning with a generative model is minimax optimal, Conference on Learning Theory, 2020.

R. Mohammad-gheshlaghi-azar, B. Munos, and . Kappen, On the sample complexity of reinforcement learning with a generative model, International Conference on Machine Learning, 2012.

R. Mohammad-gheshlaghi-azar, H. J. Munos, and . Kappen, Minimax PAC bounds on the sample complexity of reinforcement learning with a generative model, Machine Learning, vol.91, issue.3, pp.325-349, 2013.