V. Amrhein, F. Korner-nievergelt, R. , and T. , The earth is flat (p > 0.05): significance thresholds and the crisis of unreplicable research, PeerJ, vol.5, p.3544, 2017.

M. Baker, Is there a reproducibility crisis, Nature, vol.533, pp.452-454, 2016.

C. G. Begley and L. M. Ellis, Raise standards for preclinical cancer research, Nature, vol.483, p.531, 2012.

D. Benjamin, J. Berger, M. Johannesson, B. Nosek, E. Wagenmakers et al., , 2017.

R. F. Boisvert, Incentivizing reproducibility, Commun. ACM, vol.59, pp.5-5, 2016.

D. Boyd, C. , and K. , Critical questions for big data. Information, Communication & Society, vol.15, pp.662-679, 2012.

S. B. Bruns and J. P. Ioannidis, A. p-curve and p-hacking in observational research, PLOS One, vol.11, issue.2, pp.1-13, 2016.
URL : https://hal.archives-ouvertes.fr/hal-02620326

A. Cockburn, C. Gutwin, D. , and A. , HARK no more: On the preregistration of CHI experiments, Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems, vol.141, p.12, 2018.

C. Collberg and T. A. Proebsting, Repeatability in computer systems research, Commun. ACM, vol.59, pp.62-69, 2016.

I. A. Cristea and J. P. Ioannidis, A. P values in display items are ubiquitous and almost invariably significant: A survey of top science journals, PLOS One, vol.13, p.197440, 2018.
URL : https://hal.archives-ouvertes.fr/hal-01580531

G. Cumming, Understanding the New Statistics: Effect Sizes, Confidence Intervals, and Metaanalysis. Multivariate applications series, 2012.

G. Cumming, The new statistics: Why and how, Psychological Science, vol.25, pp.7-29, 2014.

P. J. Denning and . Acm, President's letter: What is experimental computer science?, Commun. ACM, vol.23, pp.543-544, 1980.

P. Dragicevic, Fair statistical communication in HCI, Modern Statistical Methods for HCI, pp.291-330, 2016.
URL : https://hal.archives-ouvertes.fr/hal-01377894

A. M. Durik, M. A. Britt, R. Reynolds, and J. Storey, The effects of hedges in persuasive arguments: A nuanced analysis of language, Journal of Language and Social Psychology, vol.27, pp.217-234, 2008.

A. Franco, N. Malhotra, and G. Simonovits, Publication bias in the social sciences: Unlocking the file drawer, Science, vol.345, pp.1502-1505, 2014.

S. N. Goodman, A comment on replication, p-values and evidence, Statistics in medicine, vol.11, pp.875-879, 1992.

O. E. Gundersen and S. Kjensmo, State of the art: Reproducibility in artificial intelligence, Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, (AAAI-18), the 30th innovative Applications of Artificial Intelligence (IAAI-18), and the 8th AAAI Symposium on Educational Advances in Artificial Intelligence (EAAI-18), pp.1644-1651, 2018.

J. P. Ioannidis, Why most published research findings are false, PLOS Medicine, vol.2, issue.8, 2005.

L. K. John, G. Loewenstein, and D. Prelec, Measuring the prevalence of questionable research practices with incentives for truth telling, Psychological Science, vol.23, pp.524-532, 2012.

R. M. Kaplan and V. L. Irvin, Likelihood of null effects of large NHLBI clinical trials has increased over time, PLOS One, vol.10, pp.1-12, 2015.

M. Kay, G. L. Nelson, and E. B. Hekler, Researcher-centered design of statistics: Why Bayesian statistics better fit the culture and incentives of HCI, Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems, pp.4521-4532, 2016.

N. L. Kerr, Harking: Hypothesizing after the results are known, Personality & Social Psychology Review, vol.2, p.196, 1998.

R. Kosara and S. Haroz, Skipping the Replication Crisis in Visualization: Threats to Study Validity and How to Address Them, Evaluation and Beyond -Methodological Approaches for Visualization, 2018.
URL : https://hal.archives-ouvertes.fr/hal-01947436

S. Krishnamurthi and J. Vitek, The real software crisis: Repeatability as a core value, Commun. ACM, vol.58, pp.34-36, 2015.

J. K. Kruschke and T. M. Liddell, The Bayesian new statistics: Hypothesis testing, estimation, meta-analysis, and power analysis from a Bayesian perspective, Psychonomic Bulletin & Review, vol.25, pp.178-206, 2018.

E. Loken and A. Gelman, Measurement error and the replication crisis, Science, vol.355, pp.584-585, 2017.

B. B. Mcshane, D. Gal, A. Gelman, C. Robert, and J. L. Tackett, Abandon statistical significance, The American Statistician, vol.73, pp.235-245, 2019.
URL : https://hal.archives-ouvertes.fr/hal-02274876

I. Monogan and J. E. , A case for registering studies of political outcomes: An application in the 2010 house elections, Political Analysis, vol.21, p.21, 2013.

R. S. Nickerson, Confirmation bias: A ubiquitous phenomenon in many guises, Review of General Psychology, vol.2, issue.2, pp.175-220, 1998.

, Open Science Collaboration and others. Estimating the reproducibility of psychological science, Science, vol.349, p.4716, 2015.

H. Pashler and E. Wagenmakers, Editors' introduction to the special section on replicability in psychological science: A crisis of confidence?, Perspectives on Psychological Science, vol.7, issue.6, pp.528-530, 2012.

J. D. Perezgonzalez and D. Frias-navarro, Retract 0.005 and propose using JASP, instead, 2017.

D. Rennie, Trial registration: A great idea switches from ignored to irresistible, JAMA, vol.292, pp.1359-1362, 2004.

R. Rosenthal, The file drawer problem and tolerance for null results, Psychological Bulletin, vol.86, pp.638-641, 1979.

D. M. Sanbonmatsu, S. S. Posavac, F. R. Kardes, and S. P. Mantel, Selective hypothesis testing, Psychonomic Bulletin & Review, vol.5, issue.2, pp.197-220, 1998.

V. Savalei and E. Dunn, Is the call to abandon p-values the red herring of the replicability crisis?, Frontiers in Psychology, vol.6, p.245, 2015.

J. P. Simmons, L. D. Nelson, and U. Simonsohn, False-positive psychology: Undisclosed flexibility in data collection and analysis allows presenting anything as significant, Psychological Science, vol.22, pp.1359-1366, 2011.

U. Simonsohn, Posterior-hacking: Selective reporting invalidates Bayesian results also, SSRN, 2014.

U. Simonsohn, L. D. Nelson, and J. P. Simmons, P-curve: A key to the file-drawer, Journal of Experimental Psychology: General, vol.143, pp.534-547, 2014.

T. D. Sterling, Publication decisions and their possible effects on inferences drawn from tests of significance -or vice versa, Journal of the American Statistical Association, vol.54, pp.30-34, 1959.

D. Trafimow and M. Marks, Basic and Applied Social Psychology, vol.37, pp.1-2, 2015.