Foreebank: Syntactic Analysis of Customer Support Forums

Abstract : We present a new treebank of English and French technical forum content which has been annotated for grammatical errors and phrase structure. This double annotation allows us to empirically measure the effect of errors on parsing performance. While it is slightly easier to parse the corrected versions of the forum sentences, the errors are not the main factor in making this kind of text hard to parse.
Document type :
Conference papers
Complete list of metadatas

Cited literature [32 references]  Display  Hide  Download

https://hal.inria.fr/hal-01188170
Contributor : Corentin Ribeyre <>
Submitted on : Friday, August 28, 2015 - 3:11:52 PM
Last modification on : Friday, June 14, 2019 - 3:36:02 PM
Long-term archiving on : Sunday, November 29, 2015 - 10:33:53 AM

File

EMNLP2015-1.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01188170, version 1

Citation

Rasoul Kaljahi, Jennifer Foster, Johann Roturier, Corentin Ribeyre, Teresa Lynn, et al.. Foreebank: Syntactic Analysis of Customer Support Forums. Conference on Empirical Methods in Natural Language Processing (EMNLP), Sep 2015, Lisboa, Portugal. ⟨hal-01188170⟩

Share

Metrics

Record views

473

Files downloads

490