A Lexicon of French Quotation Verbs for Automatic Quotation Extraction

Abstract : Quotation extraction is an important information extraction task, especially when dealing with news wires. Quotations can be found in various configurations. In this paper, we focus on direct quotations introduced by a parenthetical clause, headed by a “quotation verb”. Our study is based on a large French news wire corpus from the Agence France-Presse. We introduce and motivate an analysis at the discursive level of such quotations, which differs from the syntactic analyses generally proposed. We show how we enriched the Lefff syntactic lexicon so that it provides an account for quotation verbs heading a quotation parenthetical, especially those extracted from a news wire corpus. We also sketch how these lexical entries can be extended to the discursive level in order to model quotations introduced in a parenthetical clause in a complete way.
Complete list of metadatas

Cited literature [15 references]  Display  Hide  Download

https://hal.inria.fr/inria-00515461
Contributor : Benoît Sagot <>
Submitted on : Tuesday, September 7, 2010 - 9:14:10 AM
Last modification on : Friday, January 4, 2019 - 5:33:24 PM
Long-term archiving on : Tuesday, October 23, 2012 - 3:40:55 PM

File

lrec10vcit.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : inria-00515461, version 1

Collections

Citation

Benoît Sagot, Laurence Danlos, Rosa Stern. A Lexicon of French Quotation Verbs for Automatic Quotation Extraction. 7th international conference on Language Resources and Evaluation - LREC 2010, May 2010, Valetta, Malta. ⟨inria-00515461⟩

Share

Metrics

Record views

375

Files downloads

260