A. Abeillé, L. Clément, and F. Toussenel, Building a Treebank for French, 2003.
DOI : 10.1007/978-94-010-0201-1_10

B. Alex, C. Grover, E. Klein, and R. Tobin, Digitised historical text: Does it have to be mediOCRe?, Proceedings of KONVENS 2012, pp.401-409, 2012.

R. Beaufort and C. Mancas-thillou, A Weighted Finite-State Framework for Correcting Errors in Natural Scene OCR, Ninth International Conference on Document Analysis and Recognition (ICDAR 2007) Vol 2, pp.889-893, 2007.
DOI : 10.1109/ICDAR.2007.4377043

K. Bontcheva, M. Dimitrov, D. Maynard, V. Tablan, and H. Cunningham, Shallow methods for named entity coreference resolution, Proceedings of the TALN 2002 Confer- ence, 2002.

D. Boswell, Language models for spelling correction, CSE, p.256, 2004.

E. Brill and R. C. Moore, An improved error model for noisy channel spelling correction, Proceedings of the 38th Annual Meeting on Association for Computational Linguistics , ACL '00, pp.286-293, 2000.
DOI : 10.3115/1075218.1075255

N. Chinchor, Muc-7 named entity task definition, Seventh Message Understanding Conference (MUC-7), 1998.

S. Cucerzan and E. Brill, Spelling correction as an iterative process that exploits the collective knowledge of web users, Proceedings of Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing, pp.293-300, 2004.

P. Denis and B. Sagot, Exploitation d'une ressource lexicale pour la construction d'unétiqueteurun´unétiqueteur morphosyntaxiqué etat-de-l'art du français, Traitement Automatique des Langues Naturelles, 2010.

P. Denis and B. Sagot, Coupling an annotated corpus and a lexicon for state-of-the-art POS tagging, Language Resources and Evaluation, vol.20, issue.2, pp.721-736, 2012.
DOI : 10.1007/s10579-012-9193-0

URL : https://hal.archives-ouvertes.fr/inria-00614819

J. Francom and M. Hulden, Diacritic error detection and restoration via part-of-speech tags, Proceedings of the 6th Language and Technology Conference, 2013.

N. Friburger and D. Maurel, Finitestate transducer cascades to extract named entities in texts, Theoretical Computer Science, vol.313, pp.94-104, 2004.

A. Golding and Y. Schabes, Combining Trigram-based and feature-based methods for context-sensitive spelling correction, Proceedings of the 34th annual meeting on Association for Computational Linguistics -, pp.71-78, 1996.
DOI : 10.3115/981863.981873

M. Kernighan, K. Church, and W. Gale, A spelling correction program based on a noisy channel model, Proceedings of the 13th conference on Computational linguistics -, pp.205-210, 1990.
DOI : 10.3115/997939.997975

S. Klein and M. Kope, A voting system for automatic OCR correction, Proceedings of the Workshop On Information Retrieval and OCR: From Converting Content to Grasping Meaning, pp.1-21, 2002.

O. Kolak and P. Resnik, OCR error correction using a noisy channel model, Proceedings of the second international conference on Human Language Technology Research -, pp.257-262, 2002.
DOI : 10.3115/1289189.1289208

O. Kolak and P. Resnik, OCR postprocessing for low density languages, Proceedings of the HLT-EMNLP Conference, pp.867-874, 2005.

W. Lund and E. Ringger, Improving optical character recognition through efficient multiple system alignment, Proceedings of the 2009 joint international conference on Digital libraries, JCDL '09, pp.231-240, 2009.
DOI : 10.1145/1555400.1555437

D. Maynard, V. Tablan, C. Ursu, H. Cunningham, and Y. Wilks, Named entity recognition from diverse text types, Proceedings of the Recent Advances in Natural Language Processing Conference, pp.257-274, 2001.

D. Maynard, V. Tablan, H. Cunningham, C. Ursu, H. Saggion et al., Architectural elements of language engineering robustness, Natural Language Engineering, vol.8, issue.2-3, pp.257-274, 2002.
DOI : 10.1017/S1351324902002930

E. Mays, F. Damerau, and R. Mercer, Context based spelling correction. Information Processing and Management, pp.517-522, 1991.
DOI : 10.1016/0306-4573(91)90066-u

M. Reynaert, Multilingual text induced spelling correction, Proceedings of the Workshop on Multilingual Linguistic Ressources, MLR '04, pp.117-117, 2004.
DOI : 10.3115/1706238.1706256

B. Sagot and P. Boullier, SxPipe 2: architecture pour le traitement pré-syntaxique de corpus bruts, Traitement Automatique des Langues, vol.49, issue.2, pp.155-188, 2008.
DOI : 10.1075/lis.30.07sag

B. Sagot, The lefff, a freely available and large-coverage morphological and syntactic lexicon for french, Proceedings of LREC 2010, 2010.
URL : https://hal.archives-ouvertes.fr/inria-00521242

E. Tjong, K. Sang, and F. De-meulder, Introduction to the conll-2003 shared task: Languageindependent named entity recognition, Proceedings of Computational Natural Language Learning, pp.142-147, 2003.

J. Schaback, Multi-level feature extraction for spelling correction, IJCAI Workshop on Analytics for Noisy Unstructured Text Data, pp.79-86, 2007.

C. Shannon, A Mathematical Theory of Communication, Bell System Technical Journal, vol.27, issue.3, pp.379-423, 1948.
DOI : 10.1002/j.1538-7305.1948.tb01338.x

C. Strohmaier, C. Ringlstetter, K. Schulz, and S. Mihov, Lexical postcorrection of OCR-results: the web as a dynamic secondary dictionary?, Proceedings of the Seventh International Conference on Document Analysis and Recognition (ICDAR'03), p.11331137, 2003.

X. Tong and D. Evans, A statistical approach to automatic OCR error correction in context, Proceedings of the Fourth Workshop on Very large Corpora, pp.88-100, 1996.