R. A. Armstrong, When to use the Bonferroni correction, In: Ophthalmic and Physiological Optics, vol.34, issue.5, pp.502-508, 2014.

A. E. Roth and I. Erev, Learning in extensive-form games: Experimental data and simple dynamic models in the intermediate term, pp.164-212, 1995.

M. Osborne, Exponential versus hyperbolic discounting: a theoretical analysis, 2016.

L. Green, N. Fristoe, and J. Myerson, Temporal discounting and preference reversals in choice between delayed outcomes, In: Psychonomic Bulletin & Review, vol.1, issue.3, pp.383-389, 1994.

S. Palminteri, Con rmation bias in human reinforcement learning: Evidence from counterfactual feedback processing, PLoS computational biology, vol.13, 2017.

G. Lefebvre, Behavioural and neural characterization of optimistic reinforcement learning, Nature Human Behaviour, vol.1, p.67, 2017.