R. Arkin, Moving Up the Food Chain, pp.245-270, 2005.
DOI : 10.1093/acprof:oso/9780195166194.003.0009

J. Audibert, R. Munos, and C. Szepesvari, Exploration???exploitation tradeoff using variance estimates in multi-armed bandits, Theoretical Computer Science, vol.410, issue.19, pp.4101876-1902, 2009.
DOI : 10.1016/j.tcs.2009.01.016

URL : https://hal.archives-ouvertes.fr/hal-00711069

B. Bakker and J. Schmidhuber, Hierarchical reinforcement learning based on subgoal discovery and subpolicy specialization, Proc. of the 8-­?th Conf. on Intelligent Autonomous Systems, pp.438-445, 2004.

G. Baldassare and M. Mirolli, Intrinsically motivated learning in natural and artificial systems, 2013.
DOI : 10.1007/978-3-642-32375-1

A. Baranes, P. Y. Oudeyer, and J. Gottlieb, Eye movements reveal epistemic curiosity in human observers, Vision Research, vol.117, pp.81-90, 2015.
DOI : 10.1016/j.visres.2015.10.009

URL : https://hal.archives-ouvertes.fr/hal-01250727

A. F. Baranes, P. Y. Oudeyer, and J. Gottlieb, The effects of task difficulty, novelty and the size of the search space on intrinsically motivated exploration, Frontiers in Neuroscience, vol.86, issue.6, 2014.
DOI : 10.1016/j.neuroimage.2013.08.019

URL : https://hal.archives-ouvertes.fr/hal-01087227

M. Bardo and R. Bevins, Conditioned place preference: what does it add to our preclinical understanding of drug reward?, Psychopharmacology, vol.153, issue.1, pp.31-43, 2000.
DOI : 10.1007/s002130000569

A. Barto, Intrinsic Motivation and Reinforcement Learning, Intrinsically Motivated Learning in Natural and Artificial Systems, pp.17-47, 2013.
DOI : 10.1007/978-3-642-32375-1_2

F. Benureau and P. Oudeyer, Behavioral Diversity Generation in Autonomous Exploration through Reuse of Past Experience, Frontiers in Robotics and AI, vol.23, 2016.
DOI : 10.1145/279232.279236

URL : https://hal.archives-ouvertes.fr/hal-01404329

D. Berlyne, Structure and Direction in Thinking, 1965.

K. Beuls, Towards an agent-­?based tutoring system for Spanish verb conjugation, 2013.

K. Beuls and J. Loeckx, Steps towards intelligent MOOCs : A case study for learning counterpoint Music Learning With Massive Open Online Courses, Luc Steels, pp.119-144, 2015.

T. C. Blanchard, B. Y. Hayden, and E. S. Bromberg-­?martin, Orbitofrontal Cortex Uses Distinct Codes for Different Choice Attributes in Decisions Motivated by Curiosity, Neuron, vol.85, issue.3, pp.602-614, 2015.
DOI : 10.1016/j.neuron.2014.12.050

R. Blanchard, M. Kelley, and D. Blanchard, Defensive reactions and exploratory behavior in rats., Journal of Comparative and Physiological Psychology, vol.87, issue.6, pp.1129-1133, 1974.
DOI : 10.1037/h0037591

E. S. Bromberg-­?martin and O. Hikosaka, Midbrain Dopamine Neurons Signal Preference for Advance Information about Upcoming Rewards, Neuron, vol.63, issue.1, pp.119-126, 2009.
DOI : 10.1016/j.neuron.2009.06.009

E. Bromberg-­?martin, M. Matsumoto, and O. Hikosaka, Dopamine in Motivational Control: Rewarding, Aversive, and Alerting, Neuron, vol.68, issue.5, pp.815-849, 2009.
DOI : 10.1016/j.neuron.2010.11.022

P. Cardoso-­?leite and D. Bavelier, Video game play, attention, and learning, Current Opinion in Neurology, vol.27, issue.2, pp.185-191, 2014.
DOI : 10.1097/WCO.0000000000000077

P. Dayan and T. J. Sejnowski, Exploration bonuses and dual control, Machine Learning, vol.18, issue.1, pp.5-22, 1996.
DOI : 10.1007/BF00115298

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.164.3455

A. Barto, Intrinsic Motivation and Reinforcement Learning, Intrinsically Motivated Learning in Natural and Artificial Systems, pp.17-47, 2013.
DOI : 10.1007/978-3-642-32375-1_2

A. Barto, M. Mirolli, and G. Baldasarre, Novelty or surprise? Frontiers in Cognitive Science, 2013.

B. Clement, D. Roy, P. Oudeyer, and M. Lopes, Multi-­?Armed Bandits for Intelligent Tutoring Systems, Journal of Educational Data Mining (JEDM), vol.7, issue.2, 2015.
URL : https://hal.archives-ouvertes.fr/hal-00913669

R. Bevins, Novelty Seeking and Reward: Implications for the Study of High-Risk Behaviors, Current Directions in Psychological Science, vol.144, issue.6, p.189, 2001.
DOI : 10.1111/1467-8721.00146

D. I. Cordova and M. R. Lepper, Intrinsic motivation and the process of learning: Beneficial effects of contextualization, personalization, and choice., Journal of Educational Psychology, vol.88, issue.4, p.715, 1996.
DOI : 10.1037/0022-0663.88.4.715

M. Csikszenthmihalyi, Flow-­?the Psychology of Optimal Experience (Harper Peren-­? nial), 1991.

D. Charms and R. , Personal Causation: The Internal Affective Determinants of Behav-­? ior, 1968.

E. L. Deci, R. Koestner, and R. M. Ryan, Extrinsic Rewards and Intrinsic Motivation in Education: Reconsidered Once Again, Review of Educational Research, vol.71, issue.1, pp.1-27, 2001.
DOI : 10.3102/00346543071001001

W. N. Dember and R. W. Earl, Analysis of exploratory, manipulatory, and curiosity behaviors., Psychological Review, vol.64, issue.2, pp.91-96, 1957.
DOI : 10.1037/h0046861

N. C. Foley, D. C. Jangraw, C. Peck, J. G. Gottlieb, S. M. Mcclure et al., Novelty enhances visual salience independently of reward in the parietal lobe The wick in the candle of learning: epistemic curiosity activates reward circuitry and enhances memory, J neurosci Psychol Sci, vol.34, issue.208, pp.7947-7957, 2009.

F. Kaplan and P. Oudeyer, Motivational principles for visual know-­?how development, 2003.

F. Kaplan and P. Oudeyer, The progress-­?drive hypothesis: an interpretation of early imitation editor, Models and mechanisms of imitation and social learning: Behavioural, social and communication dimensions, pp.361-377, 2007.

F. Kaplan and P. Oudeyer, In search of the neural circuits of intrinsic motivation, Frontiers in Neuroscience, vol.1, issue.1, pp.225-236, 2007.
DOI : 10.3389/neuro.01.1.1.017.2007

C. Kidd, S. T. Piantadosi, and R. N. Aslin, The Goldilocks Effect: Human Infants Allocate Attention to Visual Sequences That Are Neither Too Simple Nor Too Complex, PLoS ONE, vol.19, issue.6, p.36399, 2012.
DOI : 10.1371/journal.pone.0036399.s003

C. Kidd, S. T. Piantadosi, and R. N. Aslin, The Goldilocks effect in infant auditory cognition, Child Development, vol.85, issue.5, pp.1795-804, 2014.

C. Kidd and B. Y. Hayden, The Psychology and Neuroscience of Curiosity, Neuron, vol.88, issue.3, pp.449-460, 2015.
DOI : 10.1016/j.neuron.2015.09.010

G. D. Konidaris, A. G. Barto, R. Cnr, I. Lake, B. M. Ullman et al., Building machines that learn and think like people, An adaptive robot motivational system Animals to Animats 9: Proceedings of the 9th International Conference on Simulation of Adaptive Behavior (SAB-­?06), p.46, 2006.

J. Lehman and K. O. Stanley, Abandoning Objectives: Evolution Through the Search for Novelty Alone, Evolutionary Computation, vol.7, issue.3, pp.189-223, 2011.
DOI : 10.1016/0165-6074(93)90215-7

T. R. Liyanagunawardena, A. A. Adams, and S. A. Williams, MOOCs: A systematic study of the published literature 2008-2012, The International Review of Research in Open and Distributed Learning, vol.14, issue.3, pp.202-227, 2013.
DOI : 10.19173/irrodl.v14i3.1455

M. Lopes and P. Y. Oudeyer, The strategic student approach for life-long exploration and learning, 2012 IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL), pp.1-8, 2012.
DOI : 10.1109/DevLrn.2012.6400807

URL : https://hal.archives-ouvertes.fr/hal-00755216

B. Meder and J. D. Nelson, Information search with situation-­?specific reward functions, Judgment and Decision Making, vol.7, pp.119-148, 2012.

K. E. Merrick and M. L. Maher, Motivated reinforcement learning: curious characters for multiuser games, 2009.
DOI : 10.1007/978-3-540-89187-1

M. Mirolli and G. Baldassarre, Functions and Mechanisms of Intrinsic Motivations, Intrinsically Motivated Learning in Natural and Artificial Systems, pp.49-72, 2013.
DOI : 10.1007/978-3-642-32375-1_3

M. Montessori, The discovery of the child, 1948.

K. Montgomery, The role of the exploratory drive in learning., Journal of Comparative and Physiological Psychology, vol.47, issue.1, pp.60-64, 1954.
DOI : 10.1037/h0054833

C. Moulin-­?frier, M. Nguyen, and P. Oudeyer, Self-organization of early vocal development in infants and machines: the role of intrinsic motivation, Frontiers in Psychology, vol.4, 2014.
DOI : 10.3389/fpsyg.2013.01006

URL : https://hal.archives-ouvertes.fr/hal-00927940

A. Myers and N. Miller, Failure to find a learned drive based on hunger; evidence for learning motivated by "exploration.", Journal of Comparative and Physiological Psychology, vol.47, issue.6, p.428, 1954.
DOI : 10.1037/h0062664

R. Nkambou, R. Mizoguchi, and J. Bourdeay, Advances in intelligent tutoring systems, 2010.
DOI : 10.1007/978-3-642-14363-2

URL : https://hal.archives-ouvertes.fr/hal-00699845

P. Oudeyer, F. Kaplan, and V. Hafner, Intrinsic Motivation Systems for Autonomous Mental Development, IEEE Transactions on Evolutionary Computation, vol.11, issue.2, pp.265-286, 2007.
DOI : 10.1109/TEVC.2006.890271

P. Oudeyer and F. Kaplan, What is intrinsic motivation? A typology of computational approaches, Frontiers in Neurorobotics, vol.1, issue.6, 2007.
DOI : 10.3389/neuro.12.006.2007

P. Oudeyer and F. Kaplan, Discovering communication, Connection Science, vol.66, issue.2, pp.189-206, 2006.
DOI : 10.1080/09540090600768567

P. Oudeyer and L. Smith, How Evolution May Work Through Curiosity-Driven Developmental Process, Topics in Cognitive Science, vol.8, issue.11, pp.492-502, 2016.
DOI : 10.3389/neuro.12.006.2007

URL : https://hal.archives-ouvertes.fr/hal-01404334

S. Papert, Mindstorms: Children, computers, and powerful ideas, 1980.
DOI : 10.1007/978-3-0348-5357-6

M. Resnick, J. Maloney, A. Monroy-­?hernández, N. Rusk, E. Eastmond et al., Scratch, Communications of the ACM, vol.52, issue.11, pp.60-67, 2009.
DOI : 10.1145/1592761.1592779

D. Roy, G. Gerber, S. Magnenat, F. Riedo, M. Chevalier et al., IniRobot: a pedagogical kit to initiate children to concepts of robotics and computer science, proceedings of RIE 2015, 2015.
URL : https://hal.archives-ouvertes.fr/hal-01144435

S. P. Singh, A. G. Barto, and N. Chentanez, Intrinsically motivated reinforcement learning, Advances in neural information processing systems, pp.1281-1288, 2004.

S. P. Singh, R. L. Lewis, A. G. Barto, and J. Sorg, Intrinsically motivated reinforcement learning: An evolutionary perspective. Autonomous Mental Development, IEEE Transactions on, vol.2, issue.2, pp.70-82, 2010.
DOI : 10.1109/tamd.2010.2051031

F. Pachet, On the Design of a Musical Flow Machine A learning zone of one's own, 2004.

C. J. Peck, D. C. Jangraw, M. Suzuki, R. Efem, and J. Gottlieb, Reward Modulates Attention Independently of Action Value in Posterior Parietal Cortex, Journal of Neuroscience, vol.29, issue.36, pp.29-11182, 2009.
DOI : 10.1523/JNEUROSCI.1929-09.2009

R. A. Rescorla and A. R. Wagner, A theory of Pavlovian conditioning: Variations in the effectiveness of reinforcement and nonreinforcement. Classical conditioning II: Current research and theory, pp.64-99, 1972.

R. Ryan and E. Deci, Intrinsic and Extrinsic Motivations: Classic Definitions and New Directions, Contemporary Educational Psychology, vol.25, issue.1, pp.54-67, 2000.
DOI : 10.1006/ceps.1999.1020

J. Schmidhuber, Curious model-building control systems, [Proceedings] 1991 IEEE International Joint Conference on Neural Networks, 1991.
DOI : 10.1109/IJCNN.1991.170605

S. Singh, A. G. Barto, C. , N. Vancouver, B. C. et al., Observing the unexpected enhances infants' learning and exploration, Intrinsically motivated reinforcement learning 18th Annual Conference on Neural Information Processing Systems (NIPS), pp.348-91, 2004.

L. Steels, Social Flow in Social MOOCs, L. Steels, Music Learning with Massive Open Online Courses, pp.119-144, 2015.

L. Steels, Music Learning with Massive Open Online Courses (pp. 119-­?144), 2015.

R. S. Sutton and A. G. Barto, Toward a modern theory of adaptive networks: Expectation and prediction., Psychological Review, vol.88, issue.2, p.135, 1981.
DOI : 10.1037/0033-295X.88.2.135

R. S. Sutton and A. G. Barto, Reinforcement Learning: An Introduction, IEEE Transactions on Neural Networks, vol.9, issue.5, 1998.
DOI : 10.1109/TNN.1998.712192

R. S. Sutton, R. S. Sutton, J. Modayil, M. Delp, T. Degris et al., Integrated architectures for learning, planning, and reacting based on approximating dynamic programming Horde: A scalable real-­?time architecture for learning knowledge from unsupervised sensorimotor interaction, Proceedings of the 7th International Conference on Machine Learning The 10th International Conference on Autonomous Agents and Multiagent Systems-­?Volume 2, pp.216-224, 1990.

F. Taffoni, E. Tamilia, V. Focaroli, D. Formica, L. Ricci et al., Development of goal-directed action selection guided by intrinsic motivations: an experiment with children, Experimental Brain Research, vol.66, issue.5, pp.2167-2177, 2014.
DOI : 10.1007/s00221-014-3907-z

L. Weiskrantz and A. Cowey, The aetiology of food reward in monkeys, Animal Behaviour, vol.11, issue.2-3, pp.225-234, 1963.
DOI : 10.1016/S0003-3472(63)80104-9

F. Weizmann, L. Cohen, and R. Pratt, Novelty, familiarity, and the development of infant attention., Developmental Psychology, vol.4, issue.2, pp.149-154, 1971.
DOI : 10.1037/h0030432

R. White, Motivation reconsidered: The concept of competence., Psychological Review, vol.66, issue.5, pp.297-333, 1959.
DOI : 10.1037/h0040934

P. Waelti, A. Dickinson, and W. Schultz, Dopamine responses comply with basic assumptions of formal learning theory, Nature, issue.6842, pp.412-455, 2001.