Sequential parameter optimization, IEEE Congress on Evolutionary Computation, 2005. ,
, Probabilistic Integration: A Role for Statisticians in Numerical Analysis? ArXiv e-prints, 2015.
A tutorial on bayesian optimization of expensive cost functions, with application to active user modeling and hierarchical reinforcement learning, 2010. ,
Bayesian optimization for learning gaits under uncertainty, Annals of Mathematics and Artificial Intelligence, 2015. ,
Using Parameterized Black-Box Priors to Scale Up Model-Based Policy Search for Robotics, International Conference on Robotics and Automation (ICRA), 2018. ,
URL : https://hal.archives-ouvertes.fr/hal-01768285
Black-Box Data-efficient Policy Search for Robotics, IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2017. ,
URL : https://hal.archives-ouvertes.fr/hal-01576683
A survey on policy search algorithms for learning robot controllers in a handful of trials, IEEE Transactions on Robotics, 2019. ,
URL : https://hal.archives-ouvertes.fr/hal-02393432
Path integral guided policy search, IEEE International Conference on Robotics and Automation (ICRA), 2017. ,
Offer: Off-environment reinforcement learning, AAAI Conference on Artificial Intelligence, 2017. ,
A statistical method for global optimization, IEEE International Conference on Systems, Man and Cybernetics, 1992. ,
Sdo: A statistical method for global optimization, Multidisciplinary Design Optimization: State-of-the-Art, 1997. ,
Evolving a behavioral repertoire for a walking robot, Evolutionary Computation, 2015. ,
URL : https://hal.archives-ouvertes.fr/hal-01095543
Robots that can adapt like animals, Nature, 2015. ,
URL : https://hal.archives-ouvertes.fr/hal-01158243
Pilco: A model-based and data-efficient approach to policy search, International Conference on Machine Learning (ICML), 2011. ,
Reinforcement learning in the presence of rare events, International Conference on Machine Learning (ICML), 2008. ,
Likelihood ratio gradient estimation for stochastic systems, Communications of the ACM, 1990. ,
Sampling for inference in probabilistic models with fast bayesian quadrature, Neural Information Processing Systems (NIPS), 2014. ,
Probabilistic numerics and uncertainty in computations, Proceedings of the Royal Society of London A: Mathematical, Physical and Robust RL with Bayesian Optimisation & Quadrature Engineering Sciences, 2015. ,
An experimental investigation of model-based parameter optimisation: Spo and beyond, Proceedings of the 11th Annual Conference on Genetic and Evolutionary Computation, 2009. ,
Evolutionary robotics and the radical envelope-of-noise hypothesis, Adaptive Behavior, 1997. ,
Noise and the reality gap: The use of simulation in evolutionary robotics, Advances in Artificial Life, 1995. ,
Lipschitzian optimization without the lipschitz constant, Journal of Optimization Theory and Applications, 1993. ,
Efficient global optimization of expensive black-box functions, Journal of Global Optimization, 1998. ,
Data-efficient reinforcement learning with probabilistic model predictive control, International Conference on Artificial Intelligence and Statistics, pp.1701-1710, 2018. ,
Convergence guarantees for kernel-based quadrature rules in misspecified settings, Neural Information Processing Systems (NIPS), 2016. ,
The transferability approach: Crossing the reality gap in evolutionary robotics, IEEE Transactions on Evolutionary Computation, 2013. ,
Contextual gaussian process bandit optimization, Neural Information Processing Systems (NIPS), 2011. ,
DART: Dynamic Animation and Robotics Toolkit, The Journal of Open Source Software, 2018. ,
Guided policy search, International Conference on International Conference on Machine Learning (ICML), 2013. ,
End-to-end training of deep visuomotor policies, Journal of Machine Learning Research, 2016. ,
Automatic design and manufacture of robotic lifeforms, Nature, 2000. ,
Automatic gait optimization with gaussian process regression, International Joint Conference on Artificial Intelligence (IJCAI), 2007. ,
Virtual vs. real: Trading off simulations and physical experiments in reinforcement learning with Bayesian optimization, International Conference on Robotics and Automation (ICRA), 2017. ,
Active policy learning for robot planning and exploration under uncertainty, Robotics: Science and Systems, 2007. ,
A bayesian exploration-exploitation approach for optimal online sensing and planning with a visually guided mobile robot, Autonomous Robots, 2009. ,
On bayesian methods for seeking the extremum, Optimization Techniques IFIP Technical Conference, 1975. ,
Illuminating search spaces by mapping elites, 2015. ,
Slice sampling, Annals of Statistics, 2000. ,
Monte carlo is fundamentally unsound, Journal of the Royal Statistical Society. Series D, 1987. ,
Bayes-hermite quadrature, Journal of Statistical Planning and Inference, 1991. ,
Active learning of model evidence using bayesian quadrature, Neural Information Processing Systems (NIPS), 2012. ,
Alternating optimisation and quadrature for robust control, AAAI Conference on Artificial Intelligence, 2018. ,
URL : https://hal.archives-ouvertes.fr/hal-01644063
Fingerprint policy optimisation for robust reinforcement learning, International Conference on Machine Learning (ICML), 2019. ,
Bayesian optimization with automatic prior selection for data-efficient direct policy search, Proceedings 2018 IEEE International Conference on Robotics and Automation (ICRA), 2018. ,
URL : https://hal.archives-ouvertes.fr/hal-01768279
Policy gradient methods for robotics, IEEE/RSJ International Conference on Intelligent Robots and Systems, 2006. ,
Robust adversarial reinforcement learning, 2017. ,
Multi-information source optimization, Neural Information Processing Systems (NIPS), 2017. ,
, Learning robust neural network policies using model ensembles. International Conference on Learning Representations (ICLR, 2017.
Bayesian monte carlo, Neural Information Processing Systems (NIPS), 2003. ,
Gaussian Processes for Machine Learning (Adaptive Computation and Machine Learning), 2005. ,
Bayesianly justifiable and relevant frequency calculations for the applied statistician. The Annals of Statistics, 1984. ,
Trust region policy optimization, International Conference on Machine Learning (ICML), 2015. ,
Active learning literature survey, 2010. ,
Input warping for bayesian optimization of non-stationary functions, International Conference on International Conference on Machine Learning (ICML), 2014. ,
Gaussian process optimization in the bandit setting: no regret and experimental design, International Conference on Machine Learning (ICML), 2010. ,
Inferring coalescence times from dna sequence data, Genetics, 1997. ,
Bayesian Optimization with Expensive Integrands, 2018. ,
Sequential design of computer experiments to minimize integrated response functions, Statistica Sinica, 2000. ,
Simple statistical gradient-following algorithms for connectionist reinforcement learning, Machine Learning, 1992. ,
Collective robot reinforcement learning with distributed asynchronous guided policy search, International Conference on Intelligent Robots and Systems (IROS), 2017. ,