Foreebank: Syntactic Analysis of Customer Support Forums - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 2015

Foreebank: Syntactic Analysis of Customer Support Forums

Résumé

We present a new treebank of English and French technical forum content which has been annotated for grammatical errors and phrase structure. This double annotation allows us to empirically measure the effect of errors on parsing performance. While it is slightly easier to parse the corrected versions of the forum sentences, the errors are not the main factor in making this kind of text hard to parse.
Fichier principal
Vignette du fichier
EMNLP2015-1.pdf (161.29 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01188170 , version 1 (28-08-2015)

Identifiants

  • HAL Id : hal-01188170 , version 1

Citer

Rasoul Kaljahi, Jennifer Foster, Johann Roturier, Corentin Ribeyre, Teresa Lynn, et al.. Foreebank: Syntactic Analysis of Customer Support Forums. Conference on Empirical Methods in Natural Language Processing (EMNLP), Sep 2015, Lisboa, Portugal. ⟨hal-01188170⟩
377 Consultations
353 Téléchargements

Partager

Gmail Facebook X LinkedIn More