Building a Treebank for French, 2003. ,
DOI : 10.1007/978-94-010-0201-1_10
Digitised historical text: Does it have to be mediOCRe?, Proceedings of KONVENS 2012, pp.401-409, 2012. ,
A Weighted Finite-State Framework for Correcting Errors in Natural Scene OCR, Ninth International Conference on Document Analysis and Recognition (ICDAR 2007) Vol 2, pp.889-893, 2007. ,
DOI : 10.1109/ICDAR.2007.4377043
Shallow methods for named entity coreference resolution, Proceedings of the TALN 2002 Confer- ence, 2002. ,
Language models for spelling correction, CSE, p.256, 2004. ,
An improved error model for noisy channel spelling correction, Proceedings of the 38th Annual Meeting on Association for Computational Linguistics , ACL '00, pp.286-293, 2000. ,
DOI : 10.3115/1075218.1075255
Muc-7 named entity task definition, Seventh Message Understanding Conference (MUC-7), 1998. ,
Spelling correction as an iterative process that exploits the collective knowledge of web users, Proceedings of Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing, pp.293-300, 2004. ,
Exploitation d'une ressource lexicale pour la construction d'unétiqueteurun´unétiqueteur morphosyntaxiqué etat-de-l'art du français, Traitement Automatique des Langues Naturelles, 2010. ,
Coupling an annotated corpus and a lexicon for state-of-the-art POS tagging, Language Resources and Evaluation, vol.20, issue.2, pp.721-736, 2012. ,
DOI : 10.1007/s10579-012-9193-0
URL : https://hal.archives-ouvertes.fr/inria-00614819
Diacritic error detection and restoration via part-of-speech tags, Proceedings of the 6th Language and Technology Conference, 2013. ,
Finitestate transducer cascades to extract named entities in texts, Theoretical Computer Science, vol.313, pp.94-104, 2004. ,
Combining Trigram-based and feature-based methods for context-sensitive spelling correction, Proceedings of the 34th annual meeting on Association for Computational Linguistics -, pp.71-78, 1996. ,
DOI : 10.3115/981863.981873
A spelling correction program based on a noisy channel model, Proceedings of the 13th conference on Computational linguistics -, pp.205-210, 1990. ,
DOI : 10.3115/997939.997975
A voting system for automatic OCR correction, Proceedings of the Workshop On Information Retrieval and OCR: From Converting Content to Grasping Meaning, pp.1-21, 2002. ,
OCR error correction using a noisy channel model, Proceedings of the second international conference on Human Language Technology Research -, pp.257-262, 2002. ,
DOI : 10.3115/1289189.1289208
OCR postprocessing for low density languages, Proceedings of the HLT-EMNLP Conference, pp.867-874, 2005. ,
Improving optical character recognition through efficient multiple system alignment, Proceedings of the 2009 joint international conference on Digital libraries, JCDL '09, pp.231-240, 2009. ,
DOI : 10.1145/1555400.1555437
Named entity recognition from diverse text types, Proceedings of the Recent Advances in Natural Language Processing Conference, pp.257-274, 2001. ,
Architectural elements of language engineering robustness, Natural Language Engineering, vol.8, issue.2-3, pp.257-274, 2002. ,
DOI : 10.1017/S1351324902002930
Context based spelling correction. Information Processing and Management, pp.517-522, 1991. ,
DOI : 10.1016/0306-4573(91)90066-u
Multilingual text induced spelling correction, Proceedings of the Workshop on Multilingual Linguistic Ressources, MLR '04, pp.117-117, 2004. ,
DOI : 10.3115/1706238.1706256
SxPipe 2: architecture pour le traitement pré-syntaxique de corpus bruts, Traitement Automatique des Langues, vol.49, issue.2, pp.155-188, 2008. ,
DOI : 10.1075/lis.30.07sag
The lefff, a freely available and large-coverage morphological and syntactic lexicon for french, Proceedings of LREC 2010, 2010. ,
URL : https://hal.archives-ouvertes.fr/inria-00521242
Introduction to the conll-2003 shared task: Languageindependent named entity recognition, Proceedings of Computational Natural Language Learning, pp.142-147, 2003. ,
Multi-level feature extraction for spelling correction, IJCAI Workshop on Analytics for Noisy Unstructured Text Data, pp.79-86, 2007. ,
A Mathematical Theory of Communication, Bell System Technical Journal, vol.27, issue.3, pp.379-423, 1948. ,
DOI : 10.1002/j.1538-7305.1948.tb01338.x
Lexical postcorrection of OCR-results: the web as a dynamic secondary dictionary?, Proceedings of the Seventh International Conference on Document Analysis and Recognition (ICDAR'03), p.11331137, 2003. ,
A statistical approach to automatic OCR error correction in context, Proceedings of the Fourth Workshop on Very large Corpora, pp.88-100, 1996. ,