A Lexicon of French Quotation Verbs for Automatic Quotation Extraction

Abstract : Quotation extraction is an important information extraction task, especially when dealing with news wires. Quotations can be found in various configurations. In this paper, we focus on direct quotations introduced by a parenthetical clause, headed by a “quotation verb”. Our study is based on a large French news wire corpus from the Agence France-Presse. We introduce and motivate an analysis at the discursive level of such quotations, which differs from the syntactic analyses generally proposed. We show how we enriched the Lefff syntactic lexicon so that it provides an account for quotation verbs heading a quotation parenthetical, especially those extracted from a news wire corpus. We also sketch how these lexical entries can be extended to the discursive level in order to model quotations introduced in a parenthetical clause in a complete way.
Type de document :
Communication dans un congrès
7th international conference on Language Resources and Evaluation - LREC 2010, May 2010, Valetta, Malta. 2010
Liste complète des métadonnées

Littérature citée [15 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/inria-00515461
Contributeur : Benoît Sagot <>
Soumis le : mardi 7 septembre 2010 - 09:14:10
Dernière modification le : samedi 9 juin 2018 - 10:30:06
Document(s) archivé(s) le : mardi 23 octobre 2012 - 15:40:55

Fichier

lrec10vcit.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : inria-00515461, version 1

Collections

Citation

Benoît Sagot, Laurence Danlos, Rosa Stern. A Lexicon of French Quotation Verbs for Automatic Quotation Extraction. 7th international conference on Language Resources and Evaluation - LREC 2010, May 2010, Valetta, Malta. 2010. 〈inria-00515461〉

Partager

Métriques

Consultations de la notice

339

Téléchargements de fichiers

219