Reinforcement Learning: An Introduction, IEEE Transactions on Neural Networks, vol.9, issue.5, 1998. ,
DOI : 10.1109/TNN.1998.712192
Decision theory, reinforcement learning, and the brain, Cognitive, Affective, & Behavioral Neuroscience, vol.8, issue.4, pp.429-453, 2008. ,
DOI : 10.3758/CABN.8.4.429
Should I stay or should I go? How the human brain manages the trade-off between exploitation and exploration, Philosophical Transactions of the Royal Society B: Biological Sciences, vol.46, issue.4, pp.933-942, 2007. ,
DOI : 10.1037/0033-295X.111.4.939
Metalearning and neuromodulation, Neural Networks, vol.15, issue.4-6, pp.495-506, 2002. ,
DOI : 10.1016/S0893-6080(02)00044-8
AN INTEGRATIVE THEORY OF LOCUS COERULEUS-NOREPINEPHRINE FUNCTION: Adaptive Gain and Optimal Performance, Annual Review of Neuroscience, vol.28, issue.1, pp.403-450, 2005. ,
DOI : 10.1146/annurev.neuro.28.061604.135709
SIMPLE NEURAL NETWORKS THAT OPTIMIZE DECISIONS, International Journal of Bifurcation and Chaos, vol.15, issue.03, pp.803-826, 2005. ,
DOI : 10.1142/S0218127405012478
Probabilistic brains: knowns and unknowns, Nature Neuroscience, vol.22, issue.9, pp.1170-1178, 2013. ,
DOI : 10.1016/j.neuron.2012.03.016
URL : http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4487650
Cortical substrates for exploratory decisions in humans, Nature, vol.15, issue.7095, pp.876-879, 2006. ,
DOI : 10.1038/nature04766
URL : http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2635947
Actor???critic models of the basal ganglia: new anatomical and computational perspectives, Neural Networks, vol.15, issue.4-6, pp.535-547, 2002. ,
DOI : 10.1016/S0893-6080(02)00047-3
Computational Explorations in Cognitive Neuroscience: Understanding the Mind by Simulating the Brain, 2000. ,
Reinforcement learning in the brain Available: http://linkinghub.elsevier.com/retrieve/pii Emotion and motivation: the role of the amygdala, ventral striatum, and prefrontal cortex, Journal of Mathematical Psychology Neuroscience & Biobehavioral Reviews, vol.53, issue.26 302, pp.139-154, 2002. ,
The nucleus accumbens as a nexus between values and goals in goal-directed behavior: a review and a new hypothesis, Frontiers in Behavioral Neuroscience, vol.7, 2013. ,
DOI : 10.3389/fnbeh.2013.00135
The integrative function of the basal ganglia in instrumental conditioning, Behavioural Brain Research, vol.199, issue.1, pp.43-52, 2009. ,
DOI : 10.1016/j.bbr.2008.10.034
Parallel incentive processing: an integrated view of amygdala function, Trends in Neurosciences, vol.29, issue.5, pp.272-279, 2006. ,
DOI : 10.1016/j.tins.2006.03.002
Multiplicity of control in the basal ganglia: computational roles of striatal subregions, Current Opinion in Neurobiology, vol.21, issue.3, pp.374-380, 2011. ,
DOI : 10.1016/j.conb.2011.02.009
Neural systems analysis of decision making during goal-directed navigation, Progress in Neurobiology, vol.96, issue.1, pp.96-135, 2012. ,
DOI : 10.1016/j.pneurobio.2011.08.010
Coordination of Actions and Habits in the Medial Prefrontal Cortex of Rats, Cerebral Cortex, vol.13, issue.4, pp.400-408, 2003. ,
DOI : 10.1093/cercor/13.4.400
Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control, Nature Neuroscience, vol.58, issue.12, pp.1704-1711, 2005. ,
DOI : 10.1038/nn1560
Uncertainty, Neuromodulation, and Attention, Neuron, vol.46, issue.4, 2005. ,
DOI : 10.1016/j.neuron.2005.04.026
URL : http://doi.org/10.1016/j.neuron.2005.04.026
Attentional control of associative learning???A possible role of the central cholinergic system, Brain Research, vol.1202, pp.43-53, 2008. ,
DOI : 10.1016/j.brainres.2007.06.097
Modulation of synaptic transmission by dopamine and norepinephrine in ventral but not dorsal striatum, Journal of neurophysiology, vol.79, issue.4, pp.1768-17769535946, 1998. ,
An exploration-exploitation model based on norepinepherine and dopamine activity, Advances in Neural Information Processing Systems 18, pp.867-874, 2006. ,
Interaction between cognitive and motor cortico-basal ganglia loops during decision making: a computational study, Journal of Neurophysiology, vol.109, issue.12, pp.3025-3040, 2013. ,
DOI : 10.1152/jn.00026.2013
URL : https://hal.archives-ouvertes.fr/hal-00828004
The human orbitofrontal cortex: linking reward to hedonic experience, Nature Reviews Neuroscience, vol.8, issue.9, pp.691-702, 2005. ,
DOI : 10.1073/pnas.0402680101
PVLV: The Primary Value and Learned Value Pavlovian Learning Algorithm., Behavioral Neuroscience, vol.121, issue.1, pp.31-49, 2007. ,
DOI : 10.1037/0735-7044.121.1.31
DANA: Distributed (asynchronous) Numerical and Adaptive modelling framework, Network: Computation in Neural Systems, vol.23, issue.4, pp.237-253, 2012. ,
URL : https://hal.archives-ouvertes.fr/hal-00718780
Conflict monitoring and anterior cingulate cortex: an update, Trends in Cognitive Sciences, vol.8, issue.12, pp.539-546, 2004. ,
DOI : 10.1016/j.tics.2004.10.003
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.335.6481
Conditioned responses of monkey locus coeruleus neurons anticipate acquisition of discriminative behavior in a vigilance task, Neuroscience, vol.80, issue.3, pp.697-715, 1997. ,
DOI : 10.1016/S0306-4522(97)00060-2
An investigation of the role of cortical and cerebellar noradrenaline in associative motor learning in the rat, Brain Research, vol.134, issue.3, pp.513-5270006, 1977. ,
DOI : 10.1016/0006-8993(77)90826-5
Orienting and Reorienting: The Locus Coeruleus Mediates Cognition through Arousal, Neuron, vol.76, issue.1, pp.130-141, 2012. ,
DOI : 10.1016/j.neuron.2012.09.011
Dopaminergic control of the exploration-exploitation trade-off via the basal ganglia, Frontiers in Neuroscience, vol.6, issue.9, 2012. ,
DOI : 10.3389/fnins.2012.00009
URL : https://hal.archives-ouvertes.fr/hal-00688928