, Emergence of locomotion behaviours in rich environments, 2017.
Human-level control through deep reinforcement learning, Nature, vol.518, issue.7540, pp.529-533, 2015. ,
Model-agnostic meta-learning for fast adaptation of deep networks, Proc. of ICML. JMLR. org, pp.1126-1135, 2017. ,
A survey on policy search algorithms for learning robot controllers in a handful of trials, IEEE Transactions on Robotics, vol.36, issue.2, pp.328-347, 2020. ,
URL : https://hal.archives-ouvertes.fr/hal-02393432
PILCO: A model-based and data-efficient approach to policy search, Proc. of ICML, 2011. ,
Black-Box Data-efficient Policy Search for Robotics, Proc. of IROS, 2017. ,
URL : https://hal.archives-ouvertes.fr/hal-01576683
Multi-objective model-based policy search for data-efficient learning with sparse rewards, Conference on Robot Learning, pp.839-855, 2018. ,
URL : https://hal.archives-ouvertes.fr/hal-01884294
Information theoretic mpc for model-based reinforcement learning, Proc. of ICRA, 2017. ,
Learning to adapt: Meta-learning for modelbased control, Proc. of ICLR, 2019. ,
Deep reinforcement learning in a handful of trials using probabilistic dynamics models, Proc. of NIPS, pp.4754-4765, 2018. ,
Encyclopedia of machine learning, pp.257-258, 2010. ,
Neural network dynamics for model-based deep reinforcement learning with model-free fine-tuning, Proc. of ICRA, pp.7559-7566, 2018. ,
Robots that can adapt like animals, Nature, vol.521, issue.7553, pp.503-507, 2015. ,
URL : https://hal.archives-ouvertes.fr/hal-01158243
Natural language processing (almost) from scratch, JMLR, vol.12, pp.2493-2537, 2011. ,
, Proximal policy optimization algorithms, 2017.
Model Identification, pp.113-138, 2016. ,
Efficient reinforcement learning for robots using informative simulated priors, Proc. of ICRA, 2015. ,
Data-efficient control policy search using residual dynamics learning, Proc. of IROS, 2017. ,
Using Parameterized Black-Box Priors to Scale Up Model-Based Policy Search for Robotics, Proc. of ICRA, 2018. ,
URL : https://hal.archives-ouvertes.fr/hal-01768285
Illuminating search spaces by mapping elites, 2015. ,
Quality diversity: A new frontier for evolutionary computation, Frontiers in Robotics and AI, vol.3, p.40, 2016. ,
Quality and diversity optimization: A unifying modular framework, IEEE Trans. on Evolutionary Computation, vol.22, issue.2, pp.245-259, 2018. ,
Evolving a behavioral repertoire for a walking robot, Evolutionary Computation, 2015. ,
URL : https://hal.archives-ouvertes.fr/hal-01095543
Reset-free trial-and-error learning for robot damage recovery, Robotics and Autonomous Systems, vol.100, pp.236-250, 2018. ,
URL : https://hal.archives-ouvertes.fr/hal-01654641
Adaptive prior selection for repertoire-based online adaptation in robotics, Frontiers in Robotics and AI, vol.6, p.151, 2020. ,
URL : https://hal.archives-ouvertes.fr/hal-02462935
Bayesian optimization with automatic prior selection for data-efficient direct policy search, Proc. of ICRA, 2018. ,
URL : https://hal.archives-ouvertes.fr/hal-01768279
Efficient transfer learning and online adaptation with latent variable models for continuous control, 2018. ,
Meta reinforcement learning with latent variable Gaussian processes, Conference on Uncertainty in Artificial Intelligence, vol.34, pp.642-652, 2018. ,
On first-order meta-learning algorithms, 2018. ,
A survey of numerical methods for optimal control, Advances in the Astronautical Sciences, vol.135, issue.1, pp.497-528, 2009. ,
The cross-entropy method for optimization, Handbook of statistics, vol.31, pp.35-59, 2013. ,
Bullet physics library, Open source: bulletphysics. org, vol.15, p.5, 2013. ,