J. Anis, Parlez-vous texto ? Guide des nouveaux langages du réseau, Le Cherche Midi, 2001.

J. Anis, Communication électronique scripturale et formes langagières, Actes des Quatrièmes Rencontres Réseaux Humain / Réseaux Technologiques, 2003.

A. Aw, M. Zhang, J. Xiao, and J. Su, A phrase-based statistical model for SMS text normalization, Proceedings of the COLING/ACL on Main conference poster sessions -, pp.33-40, 2006.
DOI : 10.3115/1273073.1273078

D. Balasuriya, N. Ringland, J. Nothman, T. Murphy, and J. R. Curran, Named entity recognition in Wikipedia, Proceedings of the 2009 Workshop on The People's Web Meets NLP Collaboratively Constructed Semantic Resources, People's Web '09, pp.10-18, 2009.
DOI : 10.3115/1699765.1699767

M. Baranes, Vers la mise en place d'un correcteur grammatical. Mémoire, 2011.

M. Baranes, Vers la correction automatique de textes bruités : Architecture générale et détermination de la langue d'un mot inconnu, Actes de la conférence conjointe JEP-TALN-RECITAL 2012, pp.95-108, 2012.

M. Baranes and B. Sagot, A Language-Independent Approach to Extracting Derivational Relations from an Inflectional Lexicon, Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14), pp.2793-2799, 2014.
URL : https://hal.archives-ouvertes.fr/hal-01002723

M. Baroni, S. Bernardini, A. Ferraresi, and E. Zanchetta, The WaCky wide web: a collection of very large linguistically processed web-crawled corpora, Language Resources and Evaluation, vol.10, issue.4, pp.209-226, 2009.
DOI : 10.1007/s10579-009-9081-4

R. Beaufort, Composition filtrée et marqueurs de règles de réécriture pour une distance d'édition flexible. application à la correction des mots horsvocabulaire, Traitement Automatique des Langues (T.A.L), pp.11-40, 2010.

R. Beaufort, S. Roekhaut, L. Cougnon, and . Et-cédrick-fairon, A Hybrid Rule/Model-Based Finite-State Framework for Normalizing SMS Messages, Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics (ACL'10), pp.770-779, 2010.

R. Kenneth and . Beesley, Language identifier : A computer program for automatic natural-language identification on on-line text, Languages as Crossroads : Proceedings of th 29th Annual Conference of the American Translators Asso- ciation'88, pp.47-54, 1988.

D. Bernhard, Apprentissage non supervisé de familles morphologiques : Comparaison de méthodes et aspects multilingues, Traitement Automatique des Langues, vol.51, issue.2, pp.11-39, 2010.

M. Daniel, R. Bikel, R. M. Schwartz, and . Weischedel, An algorithm that learns what's in a name, Machine Learning, pp.211-231, 1999.

C. R. Blair, A program for correcting spelling errors, Information and Control, vol.3, issue.1, pp.60-67, 1960.
DOI : 10.1016/S0019-9958(60)90272-2

H. Blancafort, G. Recourcé, J. Couto, B. Sagot, R. Stern et al., Traitement des inconnus : une approche systématique de l'incomplétude lexicale, Actes de la 17ème conférence sur le Traitement Automatique des Langues Naturelles, pp.1-6, 2010.

L. Bloomfield, A Set of Postulates for the Science of Language, Language, vol.2, issue.3, pp.153-164, 1926.
DOI : 10.2307/408741

P. Boissière, J. Bouraoui, F. Vella, A. Lagarrigue, M. Mojahid et al., Méthodologie d'annotation des erreurs en production écrite. principes et résultats préliminaires, Actes de la 14ème conférence sur le Traitement Automatique des Langues Naturelles, pp.529-538, 2007.

A. Borthwick, J. Sterling, E. Agichtein, and R. Grishman, Nyu : Description of the mene named entity system as used in muc-7, Proceedings of the Seventh Message Understanding Conference (MUC-7), 1998.

V. Bouvier and P. Bellot, Amélioration d'un corpus de requêtes à l'aide d'une méthode non-supervisée, In CORIA, pp.373-382, 2013.

A. Boyd, Pronunciation modeling in spelling correction for writers of English as a foreign language, Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Companion Volume: Student Research Workshop and Doctoral Consortium on, NAACL '09, pp.31-36, 2009.
DOI : 10.3115/1620932.1620938

E. Brill and R. C. Moore, An improved error model for noisy channel spelling correction, Proceedings of the 38th Annual Meeting on Association for Computational Linguistics , ACL '00, pp.286-293, 2000.
DOI : 10.3115/1075218.1075255

É. Brunelle, Antidote : Correcteur, dictionnaire et plus. BULAG (Bulletin de linguistique général et appliquée), pp.25-31, 2004.

R. Bunescu and M. Pasca, Using encyclopedic knowledge for named entity disambiguation, Proceesings of the 11th Conference of the European Chapter of the Association for Computational Linguistics (EACL-06), pp.9-16, 2006.

G. Burnage, Celex : A guide for users, 1990.

T. Cabré and R. Nazar, Towards a new approach to the study of neology, Neology and Specialised Translation 4th Joint Seminar Organised by the CVC and Termisti, 2011.

A. Cahill, N. Madnani, J. Tetreault, and D. Napolitano, Robust systems for preposition error correction using Wikipedia revisions, Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics : Human Language Technologies, pp.507-517, 2013.

A. Carlson and I. Fette, Memory-based context-sensitive spelling correction at web scale, Sixth International Conference on Machine Learning and Applications (ICMLA 2007), pp.166-171, 2007.
DOI : 10.1109/ICMLA.2007.50

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.122.8324

B. Cartoni, Constance et variabilité de l'incomplétude lexicale, Actes de la 8ème Rencontre des Étudiants Chercheurs en Informatique pour le Traitement Automatique des Langues, pp.661-669, 2006.

B. Cartoni, De l'incomplétude lexicale en traduction automatique : vers une approche morphosémantique multilingue, Thèse de doctorat, 2008.

B. William, J. M. Cavnar, and . Trenkle, N-gram based text categorization, Proceedings of the 3rd Annual Symposium on Document Analysis and Information Retrieval (SDAIR-94), pp.161-175, 1994.

S. Charest, É. Brunelle, J. Fontaine, and B. Pelletier, Élaboration automatique d'un dictionnaire de cooccurrences grand public, Actes de la 14ème conférence sur le Traitement Automatique des Langues Naturelles, pp.283-292, 2007.

M. Choudhury, R. Saraf, V. Jain, S. Sarkar, and A. Basu, Investigation and modeling of the structure of texting language, Proceedings of the IJCAI Workshop on "Analytics for Noisy Unstructured Text Data, pp.63-70, 2007.
DOI : 10.1007/s10032-007-0054-0

G. Chrupa?a, Normalizing tweets with edit scripts and recurrent neural embeddings, Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), pp.680-686, 2014.
DOI : 10.3115/v1/P14-2111

W. Kenneth, W. A. Church, and . Gale, Probability scoring for spelling correction, Statistics and Computing, vol.1, issue.2, pp.93-103, 1991.

V. Claveau-et-marie-claude and L. Homme, Structuring terminology by analogybased machine learning, Proceedings of the 7th International Conference on Terminology and Knowledge Engineering (TKE'05), 2005.

L. Clément and É. Villemonte-de-la-clergerie, Maf : a morphosyntactic annotation frame-work, Proceedings of the 2nd Language and Technology Conference (LTC'05), pp.90-94, 2005.

D. Contractor, T. A. Faruquie, L. Venkata, and . Subramaniam, Unsupervised cleansing of noisy text, Proceedings of the 23rd International Conference on Computational Linguistics (COLING'10), pp.189-196, 2010.

P. Cook and S. Stevenson, An unsupervised model for text message normalization, Proceedings of the Workshop on Computational Approaches to Linguistic Creativity, CALC '09, pp.71-78, 2009.
DOI : 10.3115/1642011.1642021

B. Courtois, Un syst??me de dictionnaires ??lectroniques pour les mots simples du fran??ais, Langue fran??aise, vol.87, issue.1, 1990.
DOI : 10.3406/lfr.1990.6323

M. Creutz and K. Lagus, Inducing the morphological lexicon of a natural language from unannotated text, Proceedings of the International and Interdisciplinary Conference on Adaptive Knowledge Representation and Reasoning (AKRR'05), pp.106-113, 2005.

S. Cucerzan and E. Brill, Spelling correction as an iterative process that exploits the collective knowledge of Web users, Proceedings of EMNLP 2004, pp.293-300, 2004.

G. Dal and F. Namer, Génération et analyse automatiques de ressources lexicales construites utilisables en recherche d'informations, Traitement Automatique des Langues, vol.41, issue.2, pp.423-446, 2000.

R. Dale and A. Kilgarriff, Helping our own : The hoo 2011 pilot shared task, Proceedings of the 13th European Workshop on Natural Language Generation, pp.242-249, 2011.

R. Dale, I. Anisimoff, and G. Narroway, Hoo 2012 : A report on the preposition and determiner error correction shared task, Proceedings of the Seventh Workshop on Building Educational Applications Using NLP, pp.54-62, 2012.

F. Damerau, A technique for computer detection and correction of spelling errors, Communications of the ACM, vol.7, issue.3, pp.171-176, 1964.
DOI : 10.1145/363958.363994

H. Daumé and I. , Notes on CG and LM-BFGS optimization of logistic regression, 2004.

P. Denis and B. Sagot, Coupling an annotated corpus and a lexicon for state-of-the-art POS tagging, Language Resources and Evaluation, vol.20, issue.2, pp.721-736, 2012.
DOI : 10.1007/s10579-012-9193-0

URL : https://hal.archives-ouvertes.fr/inria-00614819

E. Denoual, Analogical translation of unknown words in a statistical machine translation framework, Proceedings of Machine Translation Summit XI, European Association for Machine Translation, pp.135-141, 2007.

J. Didier, O. Hambursin, P. Moreau, and M. Seron, Le français m'a tuer, Actes du colloque -L'orthographe française à l'épreuve du supérieur, 2005.

A. Dister and C. Fairon, Extension des ressources lexicales grâce à un corpus dynamique, Lexicometrica, 2004.

T. Dunning, Statistical Identification of Language, 1994.

C. Dutrey and A. Peradotto, Analyse de forums de discussion pour la relation clients : du Text Mining au Web Content Mining, Actes des 11èmes Journées internationnales d'Analyse statistique des Données Textuelles (JADT'12), 2012.

A. Ekbal, E. Sourjikova, A. Frank, P. Et-simone, and . Ponzetto, Assessing the challenge of fine-grained named entity recognition and classification, Proceedings of the 2010 Named Entities Workshop (NEWS'10), pp.93-101, 2010.

C. Fairon, J. Klein, . Et-sébastien, and . Paumier, Le langage sms : révélateur d'1compétenceLe français m'a tuer". Actes du colloque "L'orthographe française à l'épreuve du supérieur, 2006.

I. Falk, D. Bernhard, and C. Gérard, From Non Word to New Word : Automatically Identifying Neologisms in French Newspapers, Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14), 2014.
URL : https://hal.archives-ouvertes.fr/hal-00959079

N. Farra, N. Tomeh, A. Rozovskaya, . Et-nizar, and . Habash, Generalized Character-Level Spelling Error Correction, Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), pp.161-167, 2014.
DOI : 10.3115/v1/P14-2027

S. Federici, S. Montemagni, and V. Pirrelli, Inferring semantic similarity from distributional evidence : an analogy-based approach to word sense disambiguation, Automatic Information Extraction and Building of Lexical Semantic Resources for NLP Applications, pp.90-97, 1997.

J. Foster, cba to check the spelling " : Investigating parser performance on discussion forum posts, Human Language Technologies : The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics, pp.381-384, 2010.

N. Friburger, Reconnaissance automatique des noms propres : application à la classification automatique de textes journalistiques, Thèse de doctorat, 2002.

J. Gao, X. Li, D. Micol, C. Quirk, X. Et et al., A large scale ranker-based system for search query spelling correction, Proceedings of the 23rd International Conference on Computational Linguistics (CoLing'10), pp.358-366, 2010.

C. Gdaniec, E. Manandise, and M. C. Mccord, Derivational Morphology to the Rescue : How It Can Help Resolve Unfound Words in MT. In Machine Translation summit VIII, European Association for Machine Translation, 2001.

R. Andrew, D. Golding, and . Roth, Applying Winnow to Context-Sensitive Spelling Correction, Proceedings of the 13th International Conference on Machine Learning (ICML'96), pp.182-190, 1996.

R. Andrew, Y. Golding, and . Schabes, Combining Trigram-based and Featurebased Methods for Context-sensitive Spelling Correction, Proceedings of the 34th Annual Meeting of the Association for Computational Linguistics (ACL'96), pp.71-78, 1996.

J. Goldsmith, Unsupervised Learning of the Morphology of a Natural Language, Computational Linguistics, vol.16, issue.1, pp.153-198, 2001.
DOI : 10.1016/0306-4573(78)90067-5

J. Gosme and Y. Lepage, Structure des trigrammes inconnus et lissage par analogie, Actes de la 18ème conférence sur le Traitement Automatique des Langues Naturelles (TALN 2011), 2011.

F. Gotti, P. Langlais, G. Lapalme, S. Charest, and É. Brunelle, Atténuation des surdétections d'un correcteur grammatical de qualité commerciale, TAL, vol.53, issue.3, pp.33-66, 2012.

S. Gouws, D. Hovy, and D. Metzler, Unsupervised mining of lexical variants from noisy text, Proceedings of the First workshop on Unsupervised Learning in NLP, pp.82-90, 2011.

G. Grefenstette, Comparing two language identification schemes, Proceedings of the 3rd International Conference on the Statistical Analysis of Textual Data (JADT'95), 1995.

É. Guimier-de-neef and S. Fessard, Évaluation d'un système de transcription de SMS, Proceedings of the 26th International Conference on Lexis and Grammar, 2007.

B. Han and T. Baldwin, Lexical normalisation of short text messages : Makn sens a# twitter, Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics : Human Language Technologies (HLT'11), pp.368-378, 2011.

M. Haspelmath, The indeterminacy of word segmentation and the nature of morphology and syntax, Folia Linguistica, vol.45, issue.1, pp.31-80, 2011.
DOI : 10.1515/flin.2011.002

H. Hassan and A. Menezes, Social text normalization using contextual graph random Walks, Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, pp.1577-1586, 2013.

N. Hathout, From Wordnet to CELEX : acquiring morphological links from dictionaries of synonyms, Proceedings of the Third International Conference on Language Resources and Evaluation (LREC'02), îles Canaries, 2002.

N. Hathout, Acquistion of the morphological structure of the lexicon based on lexical similarity and formal analogy, Proceedings of the 3rd Textgraphs Workshop on Graph-Based Algorithms for Natural Language Processing, TextGraphs '08, pp.1-8, 2008.
DOI : 10.3115/1627328.1627329

N. Hathout, Morphonette : a morphological network of French, 2010.
URL : https://hal.archives-ouvertes.fr/hal-00485503

N. Hathout and F. Namer, Règles et paradigmes en morphologie informatique lexématique, Actes de la 18ème conférence sur le Traitement Automatique des Langues Naturelles, pp.215-220, 2011.

N. Hathout, F. Sajous, and . Et-basilio-calderone, GLÀFF, a Large Versatile French Lexicon, Proceedings of the 9th International Conference on Language Resources and Evaluation (LREC'14), 2014.

K. Heafield, KenLM : faster and smaller language model queries, Proceedings of the EMNLP 2011 Sixth Workshop on Statistical Machine Translation, pp.187-197, 2011.

C. Norman and . Ingle, A language identification table. The Incorporated Linguist, pp.98-101, 1976.

A. Islam and D. Inkpen, Real-word spelling correction using Google Web IT 3-grams, Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing Volume 3, EMNLP '09, pp.1241-1249, 2009.
DOI : 10.3115/1699648.1699670

F. Issac, Cybern??ologisme??: Quelques outils informatiques pour l'identification et le traitement des n??ologismes sur le web, Langages, vol.183, issue.3, 2011.
DOI : 10.3917/lang.183.0089

URL : http://www.cairn.info/load_pdf.php?ID_ARTICLE=LANG_183_0089

C. Jacquet-pfau, Correcteurs orthographiques et grammaticaux -quel(s) outil(s) pour quel rédacteur ? Revue française de linguistique appliquée, pp.81-94, 2001.

S. Johnson, Solving the problem of language recognition, 1993.

M. Junczys-dowmunt and R. Grundkiewicz, The AMU System in the CoNLL-2014 Shared Task: Grammatical Error Correction by Data-Intensive and Feature-Rich Statistical Machine Translation, Proceedings of the Eighteenth Conference on Computational Natural Language Learning: Shared Task, pp.25-33, 2014.
DOI : 10.3115/v1/W14-1703

K. Jun-'ichi-kazama and . Torisawa, Exploiting Wikipedia as external knowledge for named entity recognition, Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL'07), pp.698-707, 2007.

D. Mark, K. W. Kernighan, W. A. Church, and . Gale, A Spelling Correction Program Based on a Noisy Channel Model, Proceedings of the 13th International Conference on Computational Linguistics, pp.205-210, 1990.

C. Kobus, F. Yvon, . Et-géraldine, and . Damnati, Transcrire les SMS comme on reconnaît la parole, Actes de la 15ème conférence sur le Traitement Automatique des Langues Naturelles, pp.128-138, 2008.

K. Kukich, Technique for automatically correcting words in text, ACM Computing Surveys, vol.24, issue.4, pp.377-439, 1992.
DOI : 10.1145/146370.146380

L. Djamel-eddine, Problématique du mot comme unité linguistique, L'unité en sciences du langage, Actes des 9èmes journée scientifique du réseau thématique, pp.11-26, 2011.

J. D. Lafferty, A. Mccallum, C. N. Fernando, and . Pereira, Conditional random fields : Probabilistic models for segmenting and labeling sequence data, Proceedings of the Eighteenth International Conference on Machine Learning (ICML'01), pp.282-289, 2001.

P. Langlais, Étude quantitative de liens entre l'analogie formelle et la morphologie constructionnelle, Actes de la 16è Conférence sur le Traitement Automatique des Langues Naturelles (TALN'09), 2009.

P. Langlais, Mapping source to target strings without alignment by analogical learning : A case study with transliteration, Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, pp.684-689, 2013.

P. Langlais and A. Patry, Enrichissement d'un lexique bilingue par analogie, Actes de la 14e conférence sur le Traitement Automatique des Langues Naturelles, pp.101-110, 2007.

P. Langlais and A. Patry, Enrichissement d'un lexique bilingue par apprentissage analogique, pp.13-40, 2008.

P. Langlais and F. Yvon, Scaling up analogical learning, Proceedings of the 22nd International Conference on Computational Linguistics, pp.51-54, 2008.

P. Langlais, F. Yvon, and P. Zweigenbaum, Improvements in analogical learning, Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics on, EACL '09, pp.487-495, 2009.
DOI : 10.3115/1609067.1609121

D. Laurent, Les vraies difficultés du français au XXI e siècle, Synapse Développement, 2012.

D. Laurent, S. Nègre, and P. Séguéla, Apport des cooccurrences à la correction et à l'analyse syntaxique, Actes de la 16ème conférence sur le Traitement Automatique des Langues Naturelles, 2009.

D. Laurent, S. Nègre, and P. Séguéla, L'analyseur syntaxique cordial dans passage, Actes de la 16ème conférence sur le Traitement Automatique des Langues Naturelles, 2009.

J. Lavallée and P. Langlais, Moranapho : un système multilingue d'analyse morphologique fondé sur l'analogie formelle, Traitement Automatique des Langues, vol.52, issue.2, pp.17-44, 2011.

J. Lee and S. Seneff, Correcting misuse of verb forms, Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics (AC'08), pp.174-182, 2008.

Y. Lepage, Solving analogies on Words : An algorithm, Proceedings of the 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics (COLING-ACL'98), pp.728-735, 1998.

Y. Lepage, Proportional analogy in Written language data Language Production , Cognition, and the Lexicon, pp.151-173, 2015.

Y. Lepage and E. Denoual, Purest ever example-based machine translation: Detailed presentation and assessment, Machine Translation, vol.23, issue.3-4, pp.251-282, 2005.
DOI : 10.1007/s10590-006-9010-x

URL : https://hal.archives-ouvertes.fr/hal-00260994

Y. Lepage, A. Lardilleux, J. Gosme, and J. Manguin, The greyc machine translation system for the iwslt 2008 evaluation campaign, Proceedings of the 5th International Workshop on Spoken Language Translation, 2008.

Y. Lepage, A. Lardilleux, and . Et-julien-gosme, The greyc translation memory for the iwslt 2009 evaluation campaign : one step beyond translation memory, The 6th International Workshop on Spoken Language Translation (IWSLT'09), pp.45-49, 2009.
URL : https://hal.archives-ouvertes.fr/hal-00447076

I. Vladimir and . Levenshtein, Binary Codes Capable of Correcting Deletions, Insertions and Reversals, Soviet Physics Doklady, vol.10, issue.8, p.707, 1966.

M. Li, Y. Zhang, M. Zhu, and M. Zhou, Exploring distributional similarity based models for query spelling correction, Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the ACL , ACL '06, pp.1025-1032, 2006.
DOI : 10.3115/1220175.1220304

F. Liu, F. Weng, B. Wang, and Y. Liu, Insertion, deletion, or substitution ? : Normalizing text messages Without pre-categorization nor supervision, Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics : Human Language Technologies (HLT'11), pp.71-76, 2011.

F. Liu, F. Weng, and X. Jiang, A broad-coverage normalization system for social media language, Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics (ACL'12), pp.1035-1044, 2012.

M. Lui and T. Baldwin, Accurate Language Identification of Twitter Messages, Proceedings of the 5th Workshop on Language Analysis for Social Media (LASM), pp.17-25, 2014.
DOI : 10.3115/v1/W14-1303

P. Magistry, Unsupervised Word Segmentation and Wordhood Assessment : The case for Mandarin Chinese, Thèse de doctorat, 2013.

L. Mangu and E. Brill, Automatic Rule Acquisition for Spelling Correction, Proceedings of ICML'97, pp.187-194, 1997.

A. Martinet, Syntaxe générale. Collection U, 1985.

B. Martins, J. Mário, and . Silva, Language identification in web pages, Proceedings of the 2005 ACM symposium on Applied computing , SAC '05, pp.764-768, 2005.
DOI : 10.1145/1066677.1066852

M. Mathieu-colas, Flexion des noms et des adjectifs composés : principes de codage, 2010.

D. Maurel, Les mots inconnus sont-ils des noms propres, pp.776-784, 2004.

D. Maurel, Prolexbase : a multilingual relational lexical database of proper names, Proceedings of the 6th edition of the Language Resources and Evaluation Conference (LREC'08), pp.334-338, 2008.
URL : https://hal.archives-ouvertes.fr/hal-01024056

A. Max and G. Wisniewski, Mining naturally-occurring corrections and paraphrases from Wikipedia's revision history, Proceedings of the 7th edition of the Language Resources and Evaluation Conference (LREC'10), pp.3143-3148, 2010.

A. Mccallum and W. Li, Early results for named entity recognition with conditional random fields, feature induction and web-enhanced lexicons, Proceedings of the seventh conference on Natural language learning at HLT-NAACL 2003 -, pp.188-191, 2003.
DOI : 10.3115/1119176.1119206

A. Mccallum, D. Freitag, C. N. Fernando, and . Pereira, Maximum entropy markov models for information extraction and segmentation, Proceedings of the Seventeenth International Conference on Machine Learning (ICML'00), pp.591-598, 2000.

D. D. Mcdonald, Internal and external evidence in the identification and semantic categorization of proper names, Corpus Processing for Lexical Acquisition, pp.21-39, 1996.

A. Meillet, Linguistique historique et linguistique générale, Slatkine, 1921.

A. Mikheev, Automatic rule induction for unknown-word guessing, Computational Linguistics, vol.23, issue.3, pp.405-423, 1997.

A. Mikheev, Document centered approach to text normalization, Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval , SIGIR '00, pp.136-143, 2000.
DOI : 10.1145/345508.345564

A. Mikheev, M. Moens, and C. Grover, Named Entity recognition without gazetteers, Proceedings of the ninth conference on European chapter of the Association for Computational Linguistics -, pp.1-8, 1999.
DOI : 10.3115/977035.977037

URL : http://acl.ldc.upenn.edu/E/E99/E99-1001.pdf

S. Miller, J. Guinness, and A. Zamanian, Name tagging with word clusters and discriminative training, Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics, pp.337-342, 2004.

R. Mitton, English Spelling and the computer, 1996.

R. Mitton, Ordering the suggestions of a spellchecker without using context, Natural Language Engineering, vol.21, issue.02, pp.173-192, 2009.
DOI : 10.1111/1467-8535.00174

R. Mitton, Fifty years of spellchecking, Writing Systems Research, vol.21, issue.1, pp.1-7, 2010.
DOI : 10.1093/wsr/wsq004

M. A. Molinero and B. Sagot, A morphological and syntactic wide-coverage lexicon for spanish : The leffe, Proceedings of the International Conference on Recent Advances in Natural Language Processing, pp.264-269, 2009.
URL : https://hal.archives-ouvertes.fr/inria-00616693

F. Moreau, V. Claveau, and P. Sébillot, Automatic Morphological Query Expansion Using Analogy-Based Machine Learning, Proceedings of 29th European Conference on IR Research (ECIR'07), pp.222-233, 2007.
DOI : 10.1007/978-3-540-71496-5_22

P. Nakov, Y. Bonev, G. Angelova, E. Cius, and . Et-walther-von-hahn, Guessing morphological classes of unknown German nouns, Recent Advances in Natural Language Processing (RANLP'2003), pp.319-326, 2003.
DOI : 10.1075/cilt.260.39nak

F. Namer, FLEMM : un analyseur flexionnel du français à base de règles, pp.523-547, 2000.

R. Nazar and I. Renau, Google books n-gram corpus used as a grammar checker, Proceedings of the Second Workshop on Computational Linguistics and Writing (CLW 2012) : Linguistic and Cognitive Aspects of Document Creation and Document Engineering, EACL 2012, pp.27-34, 2012.

M. Ndiaye and A. Vandeventer-faltin, Correcteur orthographique adapté à l'apprentissage du français. BULAG (Bulletin de linguistique général et appliquée), pp.117-134, 2004.

S. Neuvel and S. A. Fulop, Unsupervised learning of morphology without morphemes, Proceedings of the ACL-02 workshop on Morphological and phonological learning -, 2002.
DOI : 10.3115/1118647.1118651

B. New and . Lexique3, Une nouvelle base de données lexicales, Actes de la 13ème conférence sur le Traitement Automatique des Langues Naturelles, pp.892-900, 2006.

B. New, C. Pallier, and L. Ferrand, Une base de données lexicales du français contemporain sur internet : Lexique TM . L'année psychologique, pp.447-462, 2001.

B. New, C. Pallier, M. Brysbaert, and . Et-ludovic-ferrand, Lexique 2 : A new French lexical database, Behavior Research Methods, Instruments, & Computers, vol.103, issue.3, pp.516-524, 2004.
DOI : 10.3758/BF03195598

S. M. Hwee-tou-ng, Y. Wu, C. Wu, J. R. Hadiwinoto, and . Tetreault, The CoNLL-2013 shared task on grammatical error correction, Proceedings of the Seventeenth Conference on Computational Natural Language Learning : Shared Task, pp.1-12, 2013.

S. M. Hwee-tou-ng, T. Wu, C. Briscoe, R. H. Hadiwinoto, C. Susanto et al., The CoNLL-2014 shared task on grammatical error correction, Proceedings of the Eighteenth Conference on Computational Natural Language Learning : Shared Task, pp.1-14, 2014.

J. Nothman, T. Murphy, and J. R. Curran, Analysing Wikipedia and goldstandard corpora for ner training, Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics (EACL'09), pp.612-620, 2009.

D. Nouvel, Reconnaissance des entités nommées par exploration de règles d'annotation, Thèse de doctorat, 2012.

K. Oflazer, Error-tolerant finite-state recognition with applications to morphological analysis and spelling correction, Computational Linguistics, vol.22, issue.1, pp.73-89, 1996.

Y. , A. Park, and R. Levy, Automated whole sentence grammar correction using a noisy channel model, Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics : Human Language Technologies (ACL-HTL'11), pp.934-944, 2011.

J. Pedler and R. Mitton, A large list of confusion sets for spellchecking assessed against a corpus of real-word errors, Proceedings of the Seventh Conference on International Language Resources and Evaluation (LREC'10), pp.755-762, 2010.

D. Pennell and Y. Liu, A character-level machine translation approach for normalization of SMS abbreviations, Fifth International Joint Conference on Natural Language Processing, pp.974-982, 2011.

M. Pironneau, É. Brunelle, and S. Charest, Pronoun anaphora resolution for automatic correction of grammatical errors (correction automatique par résolution d'anaphores pronominales) [in french, Proceedings of TALN 2014, pp.113-124, 2014.

A. Polguère, Lexicologie et sémantique lexicale Notions fondamentales, Nouvelle édition revue et augmentée. Les Presses de l, 2008.

J. Joseph, A. Pollock, and . Zamora, Automatic spelling correction in scientific and scholarly text, Communications of the ACM, vol.27, issue.4, pp.358-368, 1984.

B. Pottier, Introduction à l'étude des structures grammaticales fondammentales, Publications linguistiques de la Faculté des Lettres et des Sciences Humaines de Nancy I, 1962.

J. Pruvost and J. Sablayrolles, Les néologismes, 2012.

C. Qing, L. Mu, and . Et-zhou-ming, Improving Query Spelling Correction Using Web Search Results, Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL'07), pp.181-189, 2007.

C. Raymond and J. Fayolle, Reconnaissance robuste d'entités nommées sur de la parole transcrite automatiquement, Conférence Traitement automatique des langues naturelles, 2010.

X. Ren and F. Perrault, The typology of unknown words, Proceedings of the 14th conference on Computational linguistics -, pp.408-414, 1992.
DOI : 10.3115/992066.992132

A. Renouf, A word in time : First findings from the investigation of dynamic text. English Language Corpora : Design, Analysis and Exploitation, pp.279-288, 1993.

M. Reynaert, Multilingual text induced spelling correction, Proceedings of the Workshop on Multilingual Linguistic Ressources, MLR '04, pp.117-117, 2004.
DOI : 10.3115/1706238.1706256

R. Rhouma and P. Langlais, Fourteen light tasks for comparing analogical learning and phrased-based machine translation, Proceedings of the 25th International Conference on Computational Linguistics (COLING-2014), pp.444-454, 2014.

C. Ringlstetter, K. U. Schulz, . Et-stoyan, and . Mihov, Orthographic Errors in Web Pages: Toward Cleaner Web Corpora, Computational Linguistics, vol.19, issue.1, pp.295-340, 2006.
DOI : 10.1007/s100320050018

L. Romary, S. Salmon-alt, and G. Francopoulo, Standards going concrete, Proceedings of the Workshop on Enhancing and Using Electronic Dictionaries, ElectricDict '04, pp.22-28, 2004.
DOI : 10.3115/1610042.1610047

URL : https://hal.archives-ouvertes.fr/inria-00100195

A. Rozovskaya, Automated methods for text correction, Thèse de doctorat, 2013.

A. Rozovskaya and D. Roth, Generating Confusion Sets for Context-Sensitive Error Correction, Proceedings of the 2010 Conference on Empirical Methods on Natural Language Processing (EMNLP'10), pp.961-970, 2010.

A. Rozovskaya, D. Roth, and V. Srikumar, Correcting Grammatical Verb Errors, Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics, pp.358-367, 2014.
DOI : 10.3115/v1/E14-1038

J. Sablayrolles, Néologie et dictionnaire(s) comme corpus d'exclusion, Néologie et terminologie dans les dictionnaires, Lexica, pp.19-36, 2008.

A. Ivan, T. Sag, F. Baldwin, A. Bond, D. Copestake et al., Multi-word expressions : A pain in the neck for NLP, Proceedings of the 3rd International Conference on Intelligent Text Processing and Computational Linguistics (CICLing'2002), pp.1-15, 2002.

B. Sagot, The Lefff , a freely available and large-coverage morphological and syntactic lexicon for French, Proceedings of the Seventh international conference on Language Resources and Evaluation, pp.2744-2751, 2010.
URL : https://hal.archives-ouvertes.fr/inria-00521242

B. Sagot, DeLex, a freely-avaible, large-scale and linguistically grounded morphological lexicon for German, Proceedings of the 9th international conference on Language Resources and Evaluation, pp.2778-2784, 2014.
URL : https://hal.archives-ouvertes.fr/hal-01022288

B. Sagot, Informatiser le lexique : Modélisation, développement et utilisation de lexiques morphologiques, syntaxiques et sémantiques. Habilitation à diriger des recherches en mathématiques et en informatique, 2016.

B. Sagot and P. Boullier, SxPipe 2 : architecture pour le traitement présyntaxique de corpus bruts, Traitement Automatique des Langues, vol.49, issue.2, pp.155-188, 2008.
DOI : 10.1075/lis.30.07sag

B. Sagot and K. Gábor, Détection et correction automatique d'entités nommées dans des corpus OCRisés, Traitement Automatique du Langage Naturel 2014, pp.437-442, 2014.

B. Sagot and R. Stern, Aleda, a free large-scale entity database for French, Proceedings of the 8th edition of the Language Resources and Evaluation Conference (LREC'12), pp.1273-1276, 2012.
URL : https://hal.archives-ouvertes.fr/hal-00699300

B. Sagot, D. Nouvel, V. Mouilleron, and M. Baranes, Extension dynamique de lexiques morphologiques pour le français à partir d'un flux textuel, Actes de la 20ème conférence sur le Traitement Automatique des Langues Naturelles, pp.407-420, 2013.

F. Sajous, N. Hathout, and . Et-basilio-calderone, Ne jetons pas le Wiktionnaire avec l'oripeau du Web ! Etudes et r??alisations fond??es sur le dictionnaire collaboratif, Actes du 4e Congrès Mondial de Linguistique Française, pp.663-680, 2014.
DOI : 10.1051/shsconf/20140801216

URL : http://doi.org/10.1051/shsconf/20140801216

F. De and S. , Cours de Linguistique Générale, 1916.

Y. Scherrer and B. Sagot, A language-independent and fully unsupervised approach to lexicon induction and part-of-speech tagging for closely related languages, Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14), pp.502-508, 2014.
URL : https://hal.archives-ouvertes.fr/hal-01022298

D. Seddah, B. Sagot, M. Candito, V. Mouilleron, and V. Combet, The french social media bank : a treebank of noisy user generated content, Proceedings of the 24th International Conference on Computational Linguistics, pp.2441-2457, 2012.
URL : https://hal.archives-ouvertes.fr/hal-00780895

H. Seo, J. Lee, S. Kim, K. Lee, S. Kang et al., A meta learning approach to grammatical error correction, Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, ACL'12, pp.328-332, 2012.

P. Sibun and J. C. Reynar, Language identification : Examining the issues, Proceedings of the 5th Symposium on Document Analysis an Information Retrieval (SDAIR-96), pp.125-135, 1996.

M. Silberztein, Le dictionnaire ??lectronique des mots compos??s, Langue fran??aise, vol.87, issue.1, pp.71-83, 1990.
DOI : 10.3406/lfr.1990.6328

M. Silberztein, Dictionnaires électroniques et analyse automatique de textes ? Le système Intex, 1993.

R. Sproat, A. W. Black, S. F. Chen, S. Kumar, M. Ostendorf et al., Normalization of non-standard words, Computer Speech & Language, vol.15, issue.3, pp.287-333, 2001.
DOI : 10.1006/csla.2001.0169

H. Stehouwer and . Menno-van-zaanen, Language models for contextual error detection and correction, Proceedings of the EACL 2009 Workshop on Computational Linguistic Aspects of Grammatical Inference, CLAGI '09, pp.41-48, 2009.
DOI : 10.3115/1705475.1705482

R. Stern and B. Sagot, Détection et résolution d'entités nommées dans des dépêches d'agence, Actes de la 17e conférence sur le Traitement Automatique des Langues Naturelles, 2010.

N. Stroppa and F. Yvon, An analogical learner for morphological analysis, Proceedings of the Ninth Conference on Computational Natural Language Learning, CONLL '05, pp.120-127, 2005.
DOI : 10.3115/1706543.1706565

N. Stroppa and F. Yvon, Du quatrième de proportion comme principe inductif : une proposition et son application à l'apprentissage de la morphologie, Traitement Automatique des Langues, vol.47, issue.1, pp.33-59, 2006.

P. Suignard and S. Kerroua, Utilisation de contextes pour la correction automatique ou semi-automatique de réclamations clients, Actes de la 20ème conférence sur le Traitement Automatique des Langues Naturelles, pp.699-706, 2013.

R. Susanto, P. Phandi, . Et-hwee-tou, and . Ng, System Combination for Grammatical Error Correction, Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp.951-962, 2014.
DOI : 10.3115/v1/D14-1102

L. Tanguy and N. Hathout, Webaffix : un outil d'acquisition morphologique dérivationnelle à partir du Web, Actes de la 9ème conférence sur le Bibliographie Traitement Automatique des Langues Naturelles, pp.245-254, 2002.

A. Tatossian, Clavardage et orthographe. Correspondance, Centre collégial de développement de matériel didactique (CCDMD), pp.26-29, 2011.

J. Tournier, Introduction descriptive à la lexicogénétique de l'anglais contemporain, 1985.

J. Tournier, Précis de lexicologie anglaise, réédition. Ellipses, 2004.

K. Toutanova and R. C. Moore, Pronunciation modeling for improved spelling correction, Proceedings of the 40th Annual Meeting on Association for Computational Linguistics , ACL '02, pp.144-151, 2002.
DOI : 10.3115/1073083.1073109

S. Vienney and C. Melian, La correction automatique du langage des nouvelles formes de communication écrite, Bulag, vol.29, pp.179-192, 2004.

J. Véronis, Computerized correction of phonographic errors, Computers and the Humanities, vol.21, issue.1, pp.43-56, 1988.
DOI : 10.1007/BF00056348

J. Véronis and É. Guimier-de-neef, Compréhension automatique des langues et interaction, Chapitre : Le traitement des nouvelles formes de communication écrite, Hermès Science, pp.227-248, 2006.

G. Walther and B. Sagot, Problèmes d'intégration morphologique d'emprunts d'origine anglaise en français, 30th International Conference on Lexis and Grammar, 2011.

C. Whitelaw, B. Hutchinson, G. Y. Chung, and G. Ellis, Using the web for language independent spellchecking and autocorrection, Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing Volume 2, EMNLP '09, pp.890-899, 2009.
DOI : 10.3115/1699571.1699629

G. Wisniewski, A. Max, and . Et-françois-yvon, Recueil et analyse d'un corpus écologique de corrections orthographiques extrait des révisions de Wikipédia, Actes de la 17ème conférence sur le Traitement Automatique des Langues Naturelles, 2010.

W. Xu, J. Tetreault, M. Chodorow, R. Grishman, and L. Zhao, Exploiting Syntactic and Distributional Information for Spelling Correction with Web-Scale N-gram Models, Proceedings of the 2011 Conference on Empirical Methods on Natural Language Processing (EMNLP'11), pp.1291-1300, 2011.

Y. Yang and J. Eisenstein, A log-linear model for unsupervised text normalization, Proceedings of Empirical Methods for Natural Language Processing (EMNLP'13), pp.61-72, 2013.

F. Yvon, Pronouncing unknown words using multi-dimensional analogies, Proceedings of the Conference of the European Speech Communication Association (EuroSpeech), pp.199-202, 1999.

F. Yvon, Paradigmatic cascades : A linguistically sound model of pronunciation by analogy, Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics, pp.428-435, 1997.

F. Yvon, spellChecker : un système de correction automatique fondé sur des automates probabilistes Livrable du Projet TRACE (ANR-09-CORD-023), 2011. Accessible à l'URL http

M. Zampieri, Using bag-of-words to distinguish similar languages: How efficient are they?, 2013 IEEE 14th International Symposium on Computational Intelligence and Informatics (CINTI), pp.37-41, 2013.
DOI : 10.1109/CINTI.2013.6705230

T. Zesch, Detecting malapropisms using measures of contextual fitness, TAL, vol.53, issue.3, pp.11-31, 2012.

P. Zumthor, Introduction aux problèmes de l'archaïsme. Cahiers de l'Association internationale des études francaises, pp.11-26, 1967.