R. S. Sutton and A. G. Barto, Reinforcement Learning: An Introduction, IEEE Transactions on Neural Networks, vol.9, issue.5, 1998.
DOI : 10.1109/TNN.1998.712192

P. Dayan and N. D. Daw, Decision theory, reinforcement learning, and the brain, Cognitive, Affective, & Behavioral Neuroscience, vol.8, issue.4, pp.429-453, 2008.
DOI : 10.3758/CABN.8.4.429

J. D. Cohen, S. M. Mcclure, and A. J. Yu, Should I stay or should I go? How the human brain manages the trade-off between exploitation and exploration, Philosophical Transactions of the Royal Society B: Biological Sciences, vol.46, issue.4, pp.933-942, 2007.
DOI : 10.1037/0033-295X.111.4.939

K. Doya, Metalearning and neuromodulation, Neural Networks, vol.15, issue.4-6, pp.495-506, 2002.
DOI : 10.1016/S0893-6080(02)00044-8

G. Aston-jones and J. D. Cohen, AN INTEGRATIVE THEORY OF LOCUS COERULEUS-NOREPINEPHRINE FUNCTION: Adaptive Gain and Optimal Performance, Annual Review of Neuroscience, vol.28, issue.1, pp.403-450, 2005.
DOI : 10.1146/annurev.neuro.28.061604.135709

E. Brown, J. Gao, P. Holmes, R. Bogacz, M. Gilzenrat et al., SIMPLE NEURAL NETWORKS THAT OPTIMIZE DECISIONS, International Journal of Bifurcation and Chaos, vol.15, issue.03, pp.803-826, 2005.
DOI : 10.1142/S0218127405012478

A. Pouget, J. M. Beck, W. J. Ma, and P. E. Latham, Probabilistic brains: knowns and unknowns, Nature Neuroscience, vol.22, issue.9, pp.1170-1178, 2013.
DOI : 10.1016/j.neuron.2012.03.016

URL : http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4487650

N. D. Daw, J. P. O-'doherty, P. Dayan, B. Seymour, and R. J. Dolan, Cortical substrates for exploratory decisions in humans, Nature, vol.15, issue.7095, pp.876-879, 2006.
DOI : 10.1038/nature04766

URL : http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2635947

D. Joel, Y. Niv, and E. Ruppin, Actor???critic models of the basal ganglia: new anatomical and computational perspectives, Neural Networks, vol.15, issue.4-6, pp.535-547, 2002.
DOI : 10.1016/S0893-6080(02)00047-3

R. O. Reilly and Y. Munakata, Computational Explorations in Cognitive Neuroscience: Understanding the Mind by Simulating the Brain, 2000.

Y. Niv, R. N. Cardinal, J. A. Parkinson, J. Hall, and B. J. Everitt, Reinforcement learning in the brain Available: http://linkinghub.elsevier.com/retrieve/pii Emotion and motivation: the role of the amygdala, ventral striatum, and prefrontal cortex, Journal of Mathematical Psychology Neuroscience & Biobehavioral Reviews, vol.53, issue.26 302, pp.139-154, 2002.

F. Mannella, K. Gurney, and G. Baldassarre, The nucleus accumbens as a nexus between values and goals in goal-directed behavior: a review and a new hypothesis, Frontiers in Behavioral Neuroscience, vol.7, 2013.
DOI : 10.3389/fnbeh.2013.00135

B. W. Balleine, M. Liljeholm, and S. B. Ostlund, The integrative function of the basal ganglia in instrumental conditioning, Behavioural Brain Research, vol.199, issue.1, pp.43-52, 2009.
DOI : 10.1016/j.bbr.2008.10.034

B. W. Balleine and S. Killcross, Parallel incentive processing: an integrated view of amygdala function, Trends in Neurosciences, vol.29, issue.5, pp.272-279, 2006.
DOI : 10.1016/j.tins.2006.03.002

A. M. Bornstein and N. D. Daw, Multiplicity of control in the basal ganglia: computational roles of striatal subregions, Current Opinion in Neurobiology, vol.21, issue.3, pp.374-380, 2011.
DOI : 10.1016/j.conb.2011.02.009

M. R. Penner and S. J. Mizumori, Neural systems analysis of decision making during goal-directed navigation, Progress in Neurobiology, vol.96, issue.1, pp.96-135, 2012.
DOI : 10.1016/j.pneurobio.2011.08.010

S. Killcross and E. Coutureau, Coordination of Actions and Habits in the Medial Prefrontal Cortex of Rats, Cerebral Cortex, vol.13, issue.4, pp.400-408, 2003.
DOI : 10.1093/cercor/13.4.400

N. D. Daw, Y. Niv, and P. Dayan, Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control, Nature Neuroscience, vol.58, issue.12, pp.1704-1711, 2005.
DOI : 10.1038/nn1560

A. J. Yu and P. Dayan, Uncertainty, Neuromodulation, and Attention, Neuron, vol.46, issue.4, 2005.
DOI : 10.1016/j.neuron.2005.04.026

URL : http://doi.org/10.1016/j.neuron.2005.04.026

W. M. Pauli and R. C. O-'reilly, Attentional control of associative learning???A possible role of the central cholinergic system, Brain Research, vol.1202, pp.43-53, 2008.
DOI : 10.1016/j.brainres.2007.06.097

S. M. Nicola and R. C. Malenka, Modulation of synaptic transmission by dopamine and norepinephrine in ventral but not dorsal striatum, Journal of neurophysiology, vol.79, issue.4, pp.1768-17769535946, 1998.

S. Mcclure, M. Gilzenrat, and J. Cohen, An exploration-exploitation model based on norepinepherine and dopamine activity, Advances in Neural Information Processing Systems 18, pp.867-874, 2006.

M. Guthrie, A. Leblois, A. Garenne, and T. Boraud, Interaction between cognitive and motor cortico-basal ganglia loops during decision making: a computational study, Journal of Neurophysiology, vol.109, issue.12, pp.3025-3040, 2013.
DOI : 10.1152/jn.00026.2013

URL : https://hal.archives-ouvertes.fr/hal-00828004

M. L. Kringelbach, The human orbitofrontal cortex: linking reward to hedonic experience, Nature Reviews Neuroscience, vol.8, issue.9, pp.691-702, 2005.
DOI : 10.1073/pnas.0402680101

R. C. O-'reilly, M. J. Frank, T. E. Hazy, and B. Watz, PVLV: The Primary Value and Learned Value Pavlovian Learning Algorithm., Behavioral Neuroscience, vol.121, issue.1, pp.31-49, 2007.
DOI : 10.1037/0735-7044.121.1.31

N. P. Rougier and J. Fix, DANA: Distributed (asynchronous) Numerical and Adaptive modelling framework, Network: Computation in Neural Systems, vol.23, issue.4, pp.237-253, 2012.
URL : https://hal.archives-ouvertes.fr/hal-00718780

M. M. Botvinick, J. D. Cohen, and C. S. Carter, Conflict monitoring and anterior cingulate cortex: an update, Trends in Cognitive Sciences, vol.8, issue.12, pp.539-546, 2004.
DOI : 10.1016/j.tics.2004.10.003

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.335.6481

G. Aston-jones, J. Rajkowski, and P. Kubiak, Conditioned responses of monkey locus coeruleus neurons anticipate acquisition of discriminative behavior in a vigilance task, Neuroscience, vol.80, issue.3, pp.697-715, 1997.
DOI : 10.1016/S0306-4522(97)00060-2

S. T. Mason and S. D. Iversen, An investigation of the role of cortical and cerebellar noradrenaline in associative motor learning in the rat, Brain Research, vol.134, issue.3, pp.513-5270006, 1977.
DOI : 10.1016/0006-8993(77)90826-5

S. J. Sara and S. Bouret, Orienting and Reorienting: The Locus Coeruleus Mediates Cognition through Arousal, Neuron, vol.76, issue.1, pp.130-141, 2012.
DOI : 10.1016/j.neuron.2012.09.011

M. D. Humphries, M. Khamassi, and K. Gurney, Dopaminergic control of the exploration-exploitation trade-off via the basal ganglia, Frontiers in Neuroscience, vol.6, issue.9, 2012.
DOI : 10.3389/fnins.2012.00009

URL : https://hal.archives-ouvertes.fr/hal-00688928