Mining Parsing Results for Lexical Correction: Toward a Complete Correction Process of Wide-Coverage Lexicons

Abstract : The coverage of a parser depends mostly on the quality of the underlying grammar and lexicon. The development of a lexicon both complete and accurate is an intricate and demanding task. We introduce a automatic process for detecting missing, incomplete and erroneous entries in a morphological and syntactic lexicon, and for suggesting corrections hypotheses for these entries. The detection of dubious lexical entries is tackled by two different techniques; the first one is based on a specific statistical model, the other one benefits from information provided by a part-of-speech tagger. The generation of correction hypotheses for dubious lexical entries is achieved by studying which modifications could improve the successful parse rate of sentences in which they occur. This process brings together various techniques based on taggers, parsers and statistical models. We report on its application for improving a large-coverage morphological and syntacic French lexicon, the Lefff.
Document type :
Conference papers
Complete list of metadatas

https://hal.inria.fr/hal-00793052
Contributor : Brigitte Briot <>
Submitted on : Thursday, February 21, 2013 - 2:46:29 PM
Last modification on : Thursday, August 29, 2019 - 2:24:09 PM

Links full text

Identifiers

Citation

Lionel Nicolas, Benoît Sagot, Miguel Molinero, Jacques Farré, Éric Villemonte de la Clergerie. Mining Parsing Results for Lexical Correction: Toward a Complete Correction Process of Wide-Coverage Lexicons. LTC 2007 - Third Language and Technology Conference, Oct 2007, Poznan, Poland. pp.178-191, ⟨10.1007/978-3-642-04235-5_16⟩. ⟨hal-00793052⟩

Share

Metrics

Record views

268