T. Baldwin, P. Cook, M. Lui, A. Mackinlay, and L. Wang, How noisy social media text, how diffrnt social media sources?, Proceedings of the Sixth International Joint Conference on Natural Language Processing, pp.356-364, 2013.

T. Baldwin, M. De-marneffe, B. Han, Y. Kim, A. Ritter et al., Shared tasks of the 2015 workshop on noisy user-generated text: Twitter lexical normalization and named entity recognition, Proceedings of the Workshop on Noisy User-generated Text, pp.126-135, 2015.

R. Beckley, Bekli: A simple approach to twitter text normalization, Proceedings of the Workshop on Noisy User-generated Text, pp.82-86, 2015.

G. Berend and E. Tasnádi, Uszeged: correction type-sensitive normalization of english tweets using efficiently indexed n-gram statistics, Proceedings of the Workshop on Noisy Usergenerated Text, pp.120-125, 2015.

J. Devlin, M. Chang, K. Lee, and K. Toutanova, Bert: Pre-training of deep bidirectional transformers for language understanding, 2018.

J. Eisenstein, What to do about bad language on the internet, HLT-NAACL, 2013.

J. Foster, cba to check the spelling": Investigating parser performance on discussion forum posts, NAACL, 2010.

R. Van-der-goot and G. Van-noord, Modeling input uncertainty in neural network dependency parsing, Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, pp.4984-4991, 2018.

R. Van-der-goot, B. Plank, and M. Nissim, To normalize, or not to normalize: The impact of normalization on part-of-speech tagging, 2017.

Z. Soorya-gopalakrishnan and . Marzi, Combating adversarial attacks using sparse representations, 2018.

B. Han and T. Baldwin, Lexical normalisation of short text messages: Makn sens a# twitter, Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, vol.1, pp.368-378, 2011.

J. Hewitt and C. D. Manning, A Structural Probe for Finding Syntax in Word Representations, Proceedings of the 2019, 2019.

, Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies

G. Jawahar, B. Sagot, and D. Seddah, What does BERT learn about the structure of language?, in proc. of the 57th Annual Meeting of the Association for Computational Linguistics (ACL), 2019.
URL : https://hal.archives-ouvertes.fr/hal-02131630

D. Jurafsky, Speech & language processing, 2018.

P. Diederik, J. Kingma, and . Ba, Adam: A method for stochastic optimization, 2014.

G. Lample and A. Conneau, Crosslingual language model pretraining, 2019.

C. Li and Y. Liu, Improving text normalization using character-blocks based models and system combination, Proceedings of COLING 2012, pp.1587-1602, 2012.

P. Michel and G. Neubig, Mtnt: A testbed for machine translation of noisy text, 2018.

S. Moon, L. Neves, and V. Carvalho, Multimodal named entity recognition for short social media posts, 2018.

E. Matthew, M. Peters, M. Neumann, M. Iyyer, C. Gardner et al., Deep contextualized word representations, 2018.

P. Ruiz, M. Cuadros, and T. Etchegoyhen, Lexical normalization of spanish tweets with rule-based components and language models, Procesamiento del Lenguaje Natural, p.8, 2014.
URL : https://hal.archives-ouvertes.fr/hal-01099241

M. Schuster and K. Nakajima, Japanese and korean voice search, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.5149-5152, 2012.

D. Seddah, B. Sagot, and M. Candito, The French Social Media Bank: a Treebank of Noisy User Generated Content, 2012.
URL : https://hal.archives-ouvertes.fr/hal-00780898

D. Supranovich and V. Patsepnia, Ihs rd: Lexical normalization for english tweets, Proceedings of the Workshop on Noisy Usergenerated Text, pp.78-81, 2015.

I. Sutskever, O. Vinyals, and Q. Le, Sequence to sequence learning with neural networks, Advances in neural information processing systems, pp.3104-3112, 2014.

K. Xu, Y. Xia, and C. Lee, Tweet normalization with syllables, Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing, vol.1, pp.920-928, 2015.

Y. Yang and J. Eisenstein, A log-linear model for unsupervised text normalization, Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, pp.61-72, 2013.