No falls, no resets: Reliable humanoid behavior in the DARPA robotics challenge, 2015 IEEE-RAS 15th International Conference on Humanoid Robots (Humanoids), 2015. ,
DOI : 10.1109/HUMANOIDS.2015.7363436
Deep learning, Nature, vol.9, issue.7553, pp.436-444, 2015. ,
DOI : 10.1007/s10994-013-5335-x
ImageNet: A large-scale hierarchical image database, 2009 IEEE Conference on Computer Vision and Pattern Recognition, 2009. ,
DOI : 10.1109/CVPR.2009.5206848
Human-level control through deep reinforcement learning, Nature, vol.101, issue.7540, pp.529-533, 2015. ,
DOI : 10.1016/S0004-3702(98)00023-X
A Survey on Policy Search for Robotics, Foundations and Trends in Robotics, vol.2, issue.1-2, pp.1-142, 2013. ,
DOI : 10.1561/2300000021
URL : http://www.ias.tu-darmstadt.de/uploads/Publications/Deisenroth_ICRA_2014.pdf
Survey of Model-Based Reinforcement Learning: Applications on Robotics, Journal of Intelligent & Robotic Systems, vol.84, issue.3, pp.1-21, 2017. ,
DOI : 10.1109/IROS.2015.7353857
Gaussian Processes for Data-Efficient Learning in Robotics and Control, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.37, issue.2, pp.408-423, 2015. ,
DOI : 10.1109/TPAMI.2013.218
Black-box data-efficient policy search for robotics, 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2017. ,
DOI : 10.1109/IROS.2017.8202137
URL : https://hal.archives-ouvertes.fr/hal-01576683
Curse of Dimensionality, Encyclopedia of Machine Learning, pp.257-258, 2011. ,
DOI : 10.14778/1454159.1454226
Efficient reinforcement learning for robots using informative simulated priors, 2015 IEEE International Conference on Robotics and Automation (ICRA), 2015. ,
DOI : 10.1109/ICRA.2015.7139550
URL : http://dspace.mit.edu/bitstream/1721.1/109303/1/How_Efficient%20reinforcement.pdf
GP-ILQG: Data-driven Robust Optimal Control for Uncertain Nonlinear Dynamical Systems, 2017. ,
Data-efficient control policy search using residual dynamics learning, 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2017. ,
DOI : 10.1109/IROS.2017.8206343
Policy search for learning robot control using sparse data, 2014 IEEE International Conference on Robotics and Automation (ICRA), 2014. ,
DOI : 10.1109/ICRA.2014.6907422
Robots that can adapt like animals, Nature, vol.26, issue.7553, pp.503-507, 2015. ,
DOI : 10.1038/nrn2332
URL : https://hal.archives-ouvertes.fr/hal-01158243
Virtual vs. real: Trading off simulations and physical experiments in reinforcement learning with Bayesian optimization, 2017 IEEE International Conference on Robotics and Automation (ICRA), 2017. ,
DOI : 10.1109/ICRA.2017.7989186
Using model knowledge for learning inverse dynamics, 2010 IEEE International Conference on Robotics and Automation, 2010. ,
DOI : 10.1109/ROBOT.2010.5509858
Incremental semiparametric inverse dynamics learning, 2016 IEEE International Conference on Robotics and Automation (ICRA), 2016. ,
DOI : 10.1109/ICRA.2016.7487177
URL : http://arxiv.org/pdf/1601.04549
Continuous control with deep reinforcement learning, 2015. ,
Trust region policy optimization, Proc. of ICML, 2015. ,
Policy search for motor primitives in robotics, Machine Learning, pp.171-203, 2011. ,
A generalized path integral control approach to reinforcement learning, JMLR, vol.11, pp.3137-3181, 2010. ,
Natural evolution strategies Completely derandomized selfadaptation in evolution strategies, Evolutionary computation, vol.1523, issue.9 2, pp.949-980, 2001. ,
Reinforcement Learning: An Introduction, IEEE Transactions on Neural Networks, vol.9, issue.5, 1998. ,
DOI : 10.1109/TNN.1998.712192
Imitation and Reinforcement Learning, IEEE Robotics & Automation Magazine, vol.17, issue.2, pp.55-62, 2010. ,
DOI : 10.1109/MRA.2010.936952
Abstract, Paladyn, Journal of Behavioral Robotics, vol.4, issue.1, pp.49-61, 2013. ,
DOI : 10.2478/pjbr-2013-0003
Behavioral repertoire learning in robotics, Proceeding of the fifteenth annual conference on Genetic and evolutionary computation conference, GECCO '13, 2013. ,
DOI : 10.1145/2463372.2463399
URL : https://hal.archives-ouvertes.fr/hal-00841958
Funnel libraries for real-time robust feedback motion planning, The International Journal of Robotics Research, vol.15, issue.8, pp.947-982, 2017. ,
DOI : 10.1109/CDC.2012.6426684
URL : http://dspace.mit.edu/bitstream/1721.1/106033/1/965380239-MIT.pdf
Sample efficient optimization for learning controllers for bipedal locomotion, 2016 IEEE-RAS 16th International Conference on Humanoid Robots (Humanoids), 2016. ,
DOI : 10.1109/HUMANOIDS.2016.7803249
URL : http://arxiv.org/pdf/1610.04795
Illuminating search spaces by mapping elites, 2015. ,
Using Centroidal Voronoi Tessellations to Scale Up the Multi-dimensional Archive of Phenotypic Elites Algorithm, IEEE Transactions on Evolutionary Computation, 2017. ,
DOI : 10.1109/TEVC.2017.2735550
URL : https://hal.archives-ouvertes.fr/hal-01630627
Taking the Human Out of the Loop: A Review of Bayesian Optimization, Proc. of the IEEE, pp.148-175, 2016. ,
DOI : 10.1109/JPROC.2015.2494218
Gaussian Processes and Reinforcement Learning for Identification and Control of an Autonomous Blimp, Proceedings 2007 IEEE International Conference on Robotics and Automation, 2007. ,
DOI : 10.1109/ROBOT.2007.363075
A generalized iterative LQG method for locally-optimal feedback control of constrained nonlinear stochastic systems, Proceedings of the 2005, American Control Conference, 2005., 2005. ,
DOI : 10.1109/ACC.2005.1469949
Model identification, Springer Handbook of Robotics, pp.113-138, 2016. ,
Exciting trajectories for the identification of base inertial parameters of robots, pp.362-375, 1992. ,
A Modular and High-Precision Motion Control System With an Integrated Motor, IEEE/ASME Transactions on Mechatronics, vol.12, issue.3, pp.317-329, 2007. ,
DOI : 10.1109/TMECH.2007.897273
Modelbased reinforcement learning with parametrized physical models and optimism-driven exploration, Proc. of ICRA, 2016. ,
DOI : 10.1109/icra.2016.7487172
URL : http://arxiv.org/pdf/1509.06824
20 years of reality gap, Proceedings of the Genetic and Evolutionary Computation Conference Companion on , GECCO '17, p.2017 ,
DOI : 10.1109/JPROC.2015.2494218
URL : https://hal.archives-ouvertes.fr/hal-01518764
Resetfree Trial-and-Error Learning for Robot Damage Recovery, 2016. ,
URL : https://hal.archives-ouvertes.fr/hal-01654641
Reinforcement learning with Gaussian processes, Proceedings of the 22nd international conference on Machine learning , ICML '05, 2005. ,
DOI : 10.1145/1102351.1102377
URL : http://www-ee.technion.ac.il/~rmeir/Publications/EngelMannorMeirICML05.pdf
Model learning for robot control: a survey, Cognitive Processing, vol.11, issue.11, pp.319-340, 2011. ,
DOI : 10.1016/S0893-6080(98)00066-5
Gaussian Processes in Machine Learning, 2006. ,
DOI : 10.1162/089976602317250933
Optimization of Gaussian process hyperparameters using Rprop, Proc. of ESANN, 2013. ,
Limbo: A fast and flexible library for Bayesian optimization, pp.1611-07343, 2016. ,
Functional stability analysis of numerical algorithms, 1990. ,
The NLopt nonlinear-optimization package ,
Model-based contextual policy search for data-efficient generalization of robot skills, Artificial Intelligence, vol.247, 2014. ,
DOI : 10.1016/j.artint.2014.11.005
The Pendubot: a mechatronic system for control research and education, Proceedings of 1995 34th IEEE Conference on Decision and Control, 1995. ,
DOI : 10.1109/CDC.1995.478951
DART: Dynamic Animation and Robotics Toolkit, The Journal of Open Source Software, vol.3, issue.22, 2018. ,
DOI : 10.1177/027836499501400606
Fault-diagnosis systems: an introduction from fault detection to fault tolerance, 2006. ,
DOI : 10.1007/3-540-30368-5