A. G. Reece, A. J. Reagan, K. L. Lix, P. S. Dodds, C. M. Danforth et al., Forecasting the onset and course of mental illness with twitter data, Nature Scientific Reports, vol.7, issue.1, pp.13-19, 2017.

J. W. Pennebaker, C. K. Chung, J. Frazee, G. M. Lavergne, and D. I. Beaver, When small words foretell academic success: The case of college admissions essays, PLOS ONE, vol.9, issue.12, pp.1932-6203, 2014.

K. Niederhoffer, J. Schler, P. Crutchley, K. Loveys, and G. Coppersmith, In your wildest dreams: The language and psychological features of dreams, Proceedings of the Fourth Workshop on Computational Linguistics and Clinical Psychology -From Linguistic Signal to Clinical Reality, Vancouver, pp.13-25, 2017.

J. Serrà, I. Leontiadis, D. Spathis, G. Stringhini, J. Blackburn et al., Class-based prediction errors to detect hate speech with out-ofvocabulary words, Proceedings of the First Workshop on Abusive Language Online, pp.36-40, 2017.

C. Homan, R. Johar, T. Liu, M. Lytle, V. Silenzio et al., Toward macro-insights for suicide prevention: Analyzing fine-grained distress at scale, Proceedings of the Workshop on Computational Linguistics and Clinical Psychology: From Linguistic Signal to Clinical Reality, pp.107-117, 2014.

M. Tomasello, The New Psychology of Language: Cognitive and Functional Approaches to Language Structure, 2nd, vol.376, pp.978-979, 2002.

N. D. Goodman and M. C. Frank, Pragmatic language interpretation as probabilistic inference, Trends in Cognitive Sciences, vol.20, issue.11, pp.818-829, 2016.

J. C. Goodman, P. S. Dale, and P. Li, Does frequency count? Parental input and the acquisition of vocabulary, Journal of Child Language, vol.35, issue.3, pp.515-531, 2008.

M. Morales, S. Scherer, and R. Levitan, A cross-modal review of indicators for depression detection systems, Proceedings of the Fourth Workshop on Computational Linguistics and Clinical Psychology -From Linguistic Signal to Clinical Reality, pp.1-12, 2017.

G. Coppersmith, K. Ngo, R. Leary, and A. Wood, Exploratory analysis of social media prior to a suicide attempt, pp.106-117, 2016.

G. Coppersmith, M. Dredze, C. Harman, and K. Hollingshead, From ADHD to SAD: Analyzing the language of mental health on twitter through self-reported diagnoses -semantic scholar, 2015.

R. Kshirsagar, R. Morris, and S. Bowman, Detecting and explaining crisis, Proceedings of the Fourth Workshop on Computational Linguistics and Clinical Psychology -From Linguistic Signal to Clinical Reality, pp.66-73, 2017.

J. H. Shen and F. Rudzicz, Detecting anxiety on Reddit, Proceedings of the Fourth Workshop on Computational Linguistics and Clinical Psychology -From Linguistic Signal to Clinical Reality, pp.58-65, 2017.

C. Cortes and V. Vapnik, Support-vector networks, Mach. Learn, vol.20, issue.3, pp.885-6125, 1995.

D. M. Blei, A. Y. Ng, and M. I. Jordan, Latent Dirichlet Allocation, J. Mach. Learn. Res, vol.3, pp.1532-4435, 2003.

J. Pennebaker, C. Chung, M. Ireland, A. Gonzales, and R. J. Booth, The development and psychometric properties of LIWC2007, Software Manual, 2007.

V. Masrani, G. Murray, T. Field, and G. Carenini, Detecting Dementia through retrospective analysis of routine blog posts by bloggers with Dementia, pp.232-237, 2017.

R. Hawkins and R. Boyd, Such stuff as dreams are made on: Dream language, LIWC norms, & personality correlates, Dreaming, vol.27, 2017.

M. Oak, A. Behera, T. Thomas, C. O. Alm, E. Prud'hommeaux et al., Generating clinically relevant texts: A case study on lifechanging events, Proceedings of the Third Workshop on Computational Lingusitics and Clinical Psychology, pp.85-94, 2016.

I. Lancashire and G. Hirst, Vocabulary changes in Agatha Christie's mysteries as an indication of Dementia: A case study, Cognitive Aging: Research and Practice, ser. Cognitive Aging: Research and Practice, pp.8-10, 2009.

C. Pool and M. Nissim, Distant supervision for emotion detection using Facebook reactions, Proceedings of the Workshop on Computational Modeling of People's Opinions, Personality, and Emotions in Social Media (PEOPLES), pp.30-39, 2016.

D. Benikova, M. Wojatzki, and T. Zesch, What does this imply? Examining the impact of implicitness on the perception of hate speech, pp.978-981, 2018.

W. Warner and J. Hirschberg, Detecting hate speech on the World Wide Web, Proceedings of the Second Workshop on Language in Social Media, pp.19-26, 2012.

A. Schmidt and M. Wiegand, A survey on hate speech detection using natural language processing, Proceedings of the Fifth International Workshop on Natural Language Processing for Social Media, pp.1-10, 2017.

V. Pérez-rosas, R. Mihalcea, K. Resnicow, S. Singh, and L. An, Building a motivational interviewing dataset, Proceedings of the Third Workshop on Computational Lingusitics and Clinical Psychology, pp.42-51, 2016.

M. Wolf, A. Horn, M. Mehl, S. Haug, J. Pennebaker et al., Computergestützte quantitative Textanalyse: ¨ Aquivalenz und Robustheit der deutschen Version des Linguistic Inquiry and Word Count, Diagnostica, vol.54, pp.85-98, 2008.

R. Baayen, H. Piepenbrock, and . Rijn, The CELEX lexical data base

U. Pennsylvania, Linguistic Data Consortium, University of Pennsylvania, 1993.

A. Fine, A. F. Frank, T. F. Jaeger, and B. Van-durme, Biases in predicting the human language model, 52nd Annual Meeting of the Association for Computational Linguistics, ACL 2014 -Proceedings of the Conference, vol.2, pp.7-12, 2014.

A. Stolcke, SRILM -an extensible language modeling toolkit, Proceedings of the 7th International Conference on Spoken Language Processing (ICSLP 2002), vol.2, 2004.

F. Pedregosa, G. Varoquaux, A. Gramfort, V. Michel, B. Thirion et al., Scikit-learn: Machine learning in Python, J. Mach. Learn. Res, vol.12, pp.1532-4435, 2011.
URL : https://hal.archives-ouvertes.fr/hal-00650905

R. N. Jørgensen, P. S. Dale, D. Bleses, and L. Fenson, Clex: Acrosslinguistic lexical norms database*, Journal of Child Language, vol.37, issue.2, pp.305-314, 2010.

M. Võ, M. Conrad, L. Kuchinke, K. Urton, M. Hofmann et al., The Berlin affective word list reloaded (BAWL-r), Behavior research methods, vol.41, pp.534-542, 2009.

. Psychometric, The Free Dictionary, M. O Toole, 2018.

T. Mikolov, K. Chen, G. Corrado, and J. Dean, Efficient estimation of word representations in vector space, Proceedings of Workshop at ICLR, 2013.

K. Cho, B. Van-merrienboer, C. Gulcehre, D. Bahdanau, F. Bougares et al., Learning phrase representations using RNN encoder-decoder for statistical machine translation, ARXIV:1406.1078 [cs, stat, pp.1724-1734, 2014.
URL : https://hal.archives-ouvertes.fr/hal-01433235

S. Hochreiter and J. Schmidhuber, Long short-term memory, Neural Comput, vol.9, issue.8, pp.1735-1780, 1997.

D. Scheffer and J. Kuhl, Der Operante Motiv-Test (OMT): Ein neuer Ansatz zur Messung impliziter Motive, pp.129-150, 2003.