A Lexicon of French Quotation Verbs for Automatic Quotation Extraction - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 2010

A Lexicon of French Quotation Verbs for Automatic Quotation Extraction

Résumé

Quotation extraction is an important information extraction task, especially when dealing with news wires. Quotations can be found in various configurations. In this paper, we focus on direct quotations introduced by a parenthetical clause, headed by a “quotation verb”. Our study is based on a large French news wire corpus from the Agence France-Presse. We introduce and motivate an analysis at the discursive level of such quotations, which differs from the syntactic analyses generally proposed. We show how we enriched the Lefff syntactic lexicon so that it provides an account for quotation verbs heading a quotation parenthetical, especially those extracted from a news wire corpus. We also sketch how these lexical entries can be extended to the discursive level in order to model quotations introduced in a parenthetical clause in a complete way.
Fichier principal
Vignette du fichier
lrec10vcit.pdf (93.27 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

inria-00515461 , version 1 (07-09-2010)

Identifiants

  • HAL Id : inria-00515461 , version 1

Citer

Benoît Sagot, Laurence Danlos, Rosa Stern. A Lexicon of French Quotation Verbs for Automatic Quotation Extraction. 7th international conference on Language Resources and Evaluation - LREC 2010, May 2010, Valetta, Malta. ⟨inria-00515461⟩
223 Consultations
273 Téléchargements

Partager

Gmail Facebook X LinkedIn More