P. Auer, N. Cesa-bianchi, Y. Freund, and R. E. Schapire, The Nonstochastic Multiarmed Bandit Problem, SIAM Journal on Computing, vol.32, issue.1, pp.48-77, 2002.
DOI : 10.1137/S0097539701398375

Y. Baram, R. El-yaniv, and K. Luz, Online choice of active learning algorithms, J. Mach. Learn. Res, vol.5, pp.255-291, 2004.

A. Baranes and P. Oudeyer, Intrinsically motivated goal exploration for active motor learning in robots: A case study, 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2010.
DOI : 10.1109/IROS.2010.5651385
URL : https://hal.archives-ouvertes.fr/inria-00541769

B. Clement, D. Roy, P. Oudeyer, and M. Lopes, Multiarmed bandits for intelligent tutoring systems, p.2015
URL : https://hal.archives-ouvertes.fr/hal-00913669

C. Cook, N. D. Goodman, and L. E. Schulz, Where science starts: Spontaneous experiments in preschoolers??? exploratory play, Cognition, vol.120, issue.3, pp.341-349, 2011.
DOI : 10.1016/j.cognition.2011.03.003

P. Delarboulas, M. Schoenauer, and M. Sebag, Open-Ended Evolutionary Robotics: An Information Theoretic Approach, Parallel Problem Solving from Nature, PPSN XI, pp.334-343, 2010.
DOI : 10.1007/978-3-642-15844-5_34
URL : https://hal.archives-ouvertes.fr/inria-00494237

S. Doncieux and J. Mouret, Beyond black-box optimization: a review of selective pressures for evolutionary robotics, Evolutionary Intelligence, vol.50, issue.1, pp.71-93, 2014.
DOI : 10.1007/s12065-014-0110-x
URL : https://hal.archives-ouvertes.fr/hal-01150254

M. Dorigo and M. Colombetti, Robot shaping: developing autonomous agents through learning, Artificial Intelligence, vol.71, issue.2, pp.321-370, 1994.
DOI : 10.1016/0004-3702(94)90047-7

A. Garivier and E. Moulines, On upper-confidence bound policies for nonstationary bandit problems, 2008.
URL : https://hal.archives-ouvertes.fr/hal-00281392

E. J. Gibson, Exploratory Behavior in the Development of Perceiving, Acting, and the Acquiring of Knowledge, Annual Review of Psychology, vol.39, issue.1, pp.1-42, 1988.
DOI : 10.1146/annurev.ps.39.020188.000245

D. E. Goldberg, Simple genetic algorithms and the minimal, deceptive problem, pp.74-88, 1987.

F. Gomez and R. Miikkulainen, Incremental Evolution of Complex General Behavior, Adaptive Behavior, vol.5, issue.3-4, pp.317-342, 1997.
DOI : 10.1177/105971239700500305

F. J. Gomez, Sustaining diversity using behavioral information distance, Proceedings of the 11th Annual conference on Genetic and evolutionary computation, GECCO '09, pp.113-120, 2009.
DOI : 10.1145/1569901.1569918

A. Gopnik, Scientific Thinking in Young Children: Theoretical Advances, Empirical Research, and Policy Implications, Science, vol.337, issue.6102, pp.1623-1627, 2012.
DOI : 10.1126/science.1223416

H. Gweon and L. Schulz, Stretching to learn: Ambiguous evidence and variability in preschoolers exploratory play, 2008.

T. Hester, M. Lopes, and P. Stone, Learning exploration strategies in model-based reinforcement learning, Proceedings of the 2013 International Conference on Autonomous Agents and Multi-agent Systems, AAMAS '13 International Foundation for Autonomous Agents and Multiagent Systems, pp.1069-1076
URL : https://hal.archives-ouvertes.fr/hal-00871861

A. Jauffret, N. Cuperlier, P. Tarroux, and P. Gaussier, From self-assessment to frustration, a small step toward autonomy in robotic navigation, Frontiers in Neurorobotics, vol.7, 2013.
DOI : 10.3389/fnbot.2013.00016

J. Kodjabachian and J. A. Meyer, Evolution and development of neural controllers for locomotion, gradient-following, and obstacle-avoidance in artificial insects, IEEE Transactions on Neural Networks, vol.9, issue.5, pp.796-812, 1998.
DOI : 10.1109/72.712153
URL : https://hal.archives-ouvertes.fr/hal-01184992

G. Konidaris and A. Barto, Sensorimotor abstraction selection for efficient, autonomous robot skill acquisition, 2008 7th IEEE International Conference on Development and Learning, 2008.
DOI : 10.1109/DEVLRN.2008.4640821

A. Krause and D. Golovin, Submodular Function Maximization, Practical Approaches to Hard Problems, pp.71-104, 2014.
DOI : 10.1017/CBO9781139177801.004

J. Lehman and K. O. Stanley, Exploiting open-endedness to solve problems through the search for novelty, Proc. of the Eleventh Intl. Conf. on Artificial Life (ALIFE XI), 2008.

J. Lenarcic, On the quantification of robot redundancy, Proceedings 1999 IEEE International Conference on Robotics and Automation (Cat. No.99CH36288C), 1999.
DOI : 10.1109/ROBOT.1999.774079

M. Lopes and P. Oudeyer, The strategic student approach for lifelong exploration and learning, Proc. ICDL-Epirob 2012, 2012.
URL : https://hal.archives-ouvertes.fr/hal-00755216

J. Maja and . Mataric, Reward functions for accelerated learning, Machine Learning: Proceedings of the Eleventh international conference, pp.181-189, 1994.

J. B. Mouret and S. Doncieux, Encouraging Behavioral Diversity in Evolutionary Robotics: An Empirical Study, Evolutionary Computation, vol.341, issue.1, pp.91-133, 2012.
DOI : 10.1016/0020-0190(92)90136-J
URL : https://hal.archives-ouvertes.fr/hal-00687609

G. L. Nemhauser, L. A. Wolsey, and M. L. Fisher, An analysis of approximations for maximizing submodular set functions???I, Mathematical Programming, pp.265-294, 1978.
DOI : 10.1007/BF01588971

S. Mai-nguyen and P. Oudeyer, Abstract, Paladyn, Journal of Behavioral Robotics, vol.3, issue.3, pp.136-146, 2012.
DOI : 10.2478/s13230-013-0110-z

P. Oudeyer, F. Kaplan, and V. V. Hafner, Intrinsic Motivation Systems for Autonomous Mental Development, IEEE Transactions on Evolutionary Computation, vol.11, issue.2, pp.265-286, 2007.
DOI : 10.1109/TEVC.2006.890271

H. Robbins, Some aspects of the sequential design of experiments, Bulletin of the American Mathematical Society, vol.58, issue.5, pp.527-535
DOI : 10.1090/S0002-9904-1952-09620-8

M. Rolf, J. J. Steil, and M. Gienger, Online Goal Babbling for rapid bootstrapping of inverse models in high dimensions, 2011 IEEE International Conference on Development and Learning (ICDL), pp.1-8, 2011.
DOI : 10.1109/DEVLRN.2011.6037368

B. Sareni and L. Krahenbuhl, Fitness sharing and niching methods revisited, IEEE Transactions on Evolutionary Computation, vol.2, issue.3, pp.97-106, 1998.
DOI : 10.1109/4235.735432
URL : https://hal.archives-ouvertes.fr/hal-00359799

J. Schmidhuber, On learning how to learn learning strategies, 1994.

E. Laura, E. B. Schulz, and . Bonawitz, Serious fun: Preschoolers engage in more exploratory play when evidence is confounded, Developmental Psychology, vol.43, issue.4, pp.1045-1050, 2007.

L. Trujillo, G. Olague, E. Lutton, F. Fernndez, and . Vega, Discovering Several Robot Behaviors through Speciation, Applications of Evolutionary Computing, pp.164-174, 2008.
DOI : 10.1007/978-3-540-78761-7_17

J. Urzelai, D. Floreano, M. Dorigo, and M. Colombetti, Incremental Robot Shaping, Connection Science, vol.10, issue.3-4, pp.341-360, 1998.
DOI : 10.1080/095400998116486

P. Whittle, Restless bandits: activity allocation in a changing world, Journal of Applied Probability, vol.1, issue.A, p.287, 1988.
DOI : 10.1214/aop/1176994469