Skip to Main content Skip to Navigation
Journal articles

Améliorer un lexique syntaxique à l'aide des tables du lexique-grammaire — Constructions impersonnelles et expressions verbales figées

Benoît Sagot 1 Laurence Danlos 2
1 SIGNES - Linguistic signs, grammar and meaning: computational logic for natural language
INRIA Futurs, Université Sciences et Technologies - Bordeaux 1, École Nationale Supérieure d'Électronique, Informatique et Radiocommunications de Bordeaux (ENSEIRB), Université Bordeaux Montaigne, CNRS - Centre National de la Recherche Scientifique : UMR5800
Abstract : We intend to develop a large-coverage morphological and syntactic lexicon for French which can be directly used in Natural Langage Processing (NLP) applications, in particular in those involving deep parsing, regardless of the underlying grammatical framework. This lexicon, named Lefff (Lexique des Formes Fléchies du Français — Lexicon of French inFlected Forms), has been under development since 2004. At the beginning, this lexicon contained only verbal morphological information, mostly automatically induced from corpora. It now covers all parts of speech, and is progressively enriched with syntactic information. In this paper, we show how we used the lexicon-grammar tables, whose development has been initiated by M. Gross, to enrich the Lefff. These tables are a valuable starting point. However, it is necessary to achieve both a linguistic and formal modeling work, in order to exploit their content in a NLP lexicon such as the Lefff. We illustrate this approach on two kinds of non-standard verbal and adjectival entries : impersonal structures and verbal idiomatic expressions.
Document type :
Journal articles
Complete list of metadatas

Cited literature [8 references]  Display  Hide  Download

https://hal.inria.fr/inria-00515460
Contributor : Benoît Sagot <>
Submitted on : Tuesday, September 7, 2010 - 9:10:32 AM
Last modification on : Thursday, June 18, 2020 - 11:28:05 AM
Document(s) archivé(s) le : Wednesday, December 8, 2010 - 2:34:05 AM

File

CCental-SagotDanlos06.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : inria-00515460, version 1

Citation

Benoît Sagot, Laurence Danlos. Améliorer un lexique syntaxique à l'aide des tables du lexique-grammaire — Constructions impersonnelles et expressions verbales figées. Cahiers du Cental, Presses universitaires de Louvain, 2008, Description linguistique pour le traitement automatique du français, 5, pp.107-126. ⟨inria-00515460⟩

Share

Metrics

Record views

463

Files downloads

634