A. Abeillé, L. Clément, and F. Toussenel, Building a Treebank for French, pp.165-187, 2003.

A. Akbik, D. Blythe, and R. Vollgraf, Contextual string embeddings for sequence labeling, 2018.

. Bender, Proceedings of the 27th International Conference on Computational Linguistics, pp.1638-1649, 2018.

G. Antoniadis, Proceedings of the Joint Conference JEP-TALN-RECITAL 2012, vol.2, 2012.

A. Baevski, S. Edunov, Y. Liu, L. Zettlemoyer, A. et al., Cloze-driven pretraining of self-attention networks, 2019.

M. Baroni, S. Bernardini, A. Ferraresi, and E. Zanchetta, The wacky wide web: A collection of very large linguistically processed web-crawled corpora, Language Resources and Evaluation, vol.43, p.9, 2009.

R. Bawden, M. Botalla, K. Gerdes, and S. Kahane, Correcting and validating syntactic dependency in the spoken French treebank rhapsodie, Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14), pp.2320-2325, 2014.
URL : https://hal.archives-ouvertes.fr/halshs-01011059

O. Bonami and S. Beniamine, Implicative structure and joint predictiveness, 2015.
URL : https://hal.archives-ouvertes.fr/hal-01178211

L. Burnard, 520 million words, 1990-present, The British National Corpus, version 3 -BNC XML Edition, 2007.

M. Candito and B. Crabbé, Improving generative statistical parsing with semi-supervised word clustering, Proc. of IWPT'09, 2009.
URL : https://hal.archives-ouvertes.fr/hal-00495267

M. Candito, D. Seddah, and . Antoniadis, Le corpus sequoia : annotation syntaxique et exploitation pour l'adaptation d'analyseur par pont lexical (the sequoia corpus : Syntactic annotation and use for a parser lexical domain adaptation method), pp.321-334, 2012.

M. Candito, G. Perrier, B. Guillaume, C. Ribeyre, K. Fort et al., Deep syntax annotation of the sequoia french treebank, Proceedings of the Ninth International Conference on Language Resources and Evaluation, LREC 2014, pp.2298-2305, 2014.
URL : https://hal.archives-ouvertes.fr/hal-00969191

M. Davies, 520 million words, 1990-present, The Corpus of Contemporary American English (COCA), 2008.

J. Devlin, M. Chang, K. Lee, and K. Toutanova, BERT: pre-training of deep bidirectional transformers for language understanding, Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2019, pp.4171-4186, 2019.

T. Dozat and C. D. Manning, Deep biaffine attention for neural dependency parsing, 5th International Conference on Learning Representations, 2017.

Y. Dupont, Exploration de traits pour la reconnaissance d'entités nommées du français par apprentissage automatique, 24e Conférence sur le Traitement Automatique des Langues Naturelles (TALN), p.42, 2017.

Y. Dupont, Exploration de traits pour la reconnaissance d'entit'es nomm'ees du français par apprentissage automatique, 24e Conf'erence sur le Traitement Automatique des Langues Naturelles (TALN), p.42, 2018.

M. Fabre, S. Bhattasali, H. , and J. , Processing mwes: Neurocognitive bases of verbal mwes and lexical cohesiveness within mwes, Proceedings of the 14th Workshop on Multiword Expressions (COLING 2018), 2018.

M. Fabre, S. Bhattasali, W. Luh, H. A. Saied, M. Constant et al., Localising memory retrieval and syntactic composition: an fmri study of naturalistic language comprehension. Language, Cognition and Neuroscience, vol.34, pp.491-510, 2019.
URL : https://hal.archives-ouvertes.fr/hal-01930201

M. Fabre, S. Bhattasali, C. Pallier, H. , and J. , , 2020.

, Modeling conventionalization and predictability in multiword expressions at the brain level, Proceedings of the Society for Computation in Linguistics, pp.2331-2336, 2020.

E. Grave, T. Mikolov, A. Joulin, and P. Bojanowski, Bag of tricks for efficient text classification, Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics, vol.2, pp.427-431, 2017.

E. Grave, P. Bojanowski, P. Gupta, A. Joulin, and T. Mikolov, Learning word vectors for 157 languages, Proceedings of the Eleventh International Conference on Language Resources and Evaluation, LREC 2018, 2018.

E. Grave, P. Bojanowski, P. Gupta, A. Joulin, and T. Mikolov, Learning word vectors for 157 languages, Proceedings of the 11th Language Resources and Evaluation Conference, 2018.

F. Hill, A. Bordes, S. Chopra, W. , and J. , The goldilocks principle: Reading children's books with explicit memory representations, 2015.

Z. Huang, W. Xu, Y. , and K. , Bidirectional LSTM-CRF models for sequence tagging, 2015.

A. Joulin, E. Grave, P. Bojanowski, M. Douze, H. Jégou et al., Fasttext.zip: Compressing text classification models, 2016.

P. Koehn, Europarl: A Parallel Corpus for Statistical Machine Translation, Conference Proceedings: the tenth Machine Translation Summit, pp.79-86, 2005.

A. Lacheret, S. Kahane, J. Beliao, A. Dister, K. Gerdes et al.,

A. Tchobanov, Rhapsodie: a prosodic-syntactic treebank for spoken French, Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14), pp.295-301, 2014.
URL : https://hal.archives-ouvertes.fr/hal-00968959

J. D. Lafferty, A. Mccallum, and F. C. Pereira, Conditional random fields: Probabilistic models for segmenting and labeling sequence data, Proceedings of the Eighteenth International Conference on Machine Learning, pp.282-289, 2001.

G. Lample, M. Ballesteros, S. Subramanian, K. Kawakami, and C. Dyer, Neural architectures for named entity recognition, The 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp.260-270, 2016.

Y. Liu, M. Ott, N. Goyal, J. Du, M. Joshi et al., , 2019.

L. Martin, B. Muller, P. J. Ortiz-suárez, Y. Dupont, L. Romary et al., CamemBERT: a Tasty French Language Model. arXiv e-prints, 2019.
URL : https://hal.archives-ouvertes.fr/hal-02445946

R. Mcdonald, J. Nivre, Y. Quirmbach-brundage, Y. Goldberg, D. Das et al., Universal dependency annotation for multilingual parsing, Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, vol.2, pp.92-97, 2013.

J. Nivre, M. Abrams, ?. Agi?, L. Ahrenberg, L. Antonsen et al., Faculty of Mathematics and Physics, 2018.

P. J. Ortiz-suárez, B. Sagot, and L. Romary, , 2019.

, Asynchronous Pipeline for Processing Huge Corpora on Medium to Low Resource Infrastructures, 7th Workshop on the Challenges in the Management of Large Corpora

M. E. Peters, M. Neumann, M. Iyyer, M. Gardner, C. Clark et al., Deep contextualized word representations, Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, vol.1, pp.2227-2237, 2018.

S. Petrov, D. Das, and R. Mcdonald, A universal part-of-speech tagset, 2011.

S. Pradhan, A. Moschitti, N. Xue, O. Uryupina, and Y. Zhang, CoNLL-2012 shared task: Modeling multilingual unrestricted coreference in ontonotes, Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning -Proceedings of the Shared Task: Modeling Multilingual Unrestricted Coreference in OntoNotes, EMNLP-CoNLL, pp.1-40, 2012.

S. Pradhan, A. Moschitti, N. Xue, H. T. Ng, A. Björkelund et al., Towards robust linguistic analysis using ontonotes, Proceedings of the Seventeenth Conference on Computational Natural Language Learning, pp.143-152, 2013.

C. Raffel, N. Shazeer, A. Roberts, K. Lee, S. Narang et al., Exploring the limits of transfer learning with a unified textto, 2019.

B. Sagot, M. Richard, and R. Stern, Annotation référentielle du corpus arboré de Paris 7 en entités nommées (referential named entity annotation of the paris 7 french treebank), 2012.

I. Antoniadis, , pp.535-542, 2012.

E. F. Sang and F. D. Meulder, Introduction to the CoNLL-2003 shared task: Language-independent named entity recognition, Proceedings of the Seventh Conference on Natural Language Learning, pp.142-147, 2003.

E. F. Sang, Introduction to the CoNLL-2002 shared task: Language-independent named entity recognition, Proceedings of the 6th Conference on Natural Language Learning, 2002.

M. Sanguinetti and C. Bosco, Harmonization and Development of Resources and Tools for Italian Natural Language Processing within the PARLI Project, Studies in Computational Intelligence, vol.589, pp.51-69, 2015.

A. Seker, A. More, and R. Tsarfaty, Universal morpho-syntactic parsing and the contribution of lexica: Analyzing the onlp lab submission to the conll 2018 shared task, Proceedings of the CoNLL 2018 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies, pp.208-215, 2018.

M. Straka, J. Straková, and J. Hajic, Evaluating contextualized embeddings on 54 languages in POS tagging, lemmatization and dependency parsing, 2019.

M. Straka, UDPipe 2.0 prototype at CoNLL 2018 UD shared task, Proceedings of the CoNLL 2018 Shared Task: Multilingual Parsing from Raw Text to Universal Dependencies, pp.197-207, 2018.

J. Straková, M. Straka, and J. Hajic, Neural architectures for nested NER through linearization, Proceedings of the 57th Conference of the Association for Computational Linguistics, ACL 2019, vol.1, pp.5326-5331, 2019.