, , 2016.
To bonferroni or not to bonferroni: when and how are the questions, Bulletin of the Ecological Society of America, vol.81, issue.3, pp.246-248, 2000. ,
, , 2017.
, Deep reinforcement learning that matters, 2017.
Reproducibility of benchmarked deep reinforcement learning tasks for continuous control, Proceedings of the ICML 2017 workshop on Reproducibility in Machine Learning (RML), 2017. ,
, Continuous control with deep reinforcement learning, 2015.
Simple random search provides a competitive approach to reinforcement learning, 2018. ,
, Parameter space noise for exploration, 2017.
Analyzing tables of statistical tests, Evolution, vol.43, issue.1, pp.223-225, 1989. ,
Trust region policy optimization, 2015. ,
, Proximal policy optimization algorithms, 2017.
The generalization ofstudent's' problem when several different population variances are involved, Biometrika, vol.34, issue.1/2, pp.28-35, 1947. ,
Scalable trust-region method for deep reinforcement learning using kronecker-factored approximation, 2017. ,