Hard Time Parsing Questions: Building a QuestionBank for French - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 2016

Hard Time Parsing Questions: Building a QuestionBank for French

Résumé

We present the French Question Bank, a treebank of 2600 questions. We show that classical parsing model performance drop while the inclusion of this data set is highly beneficial without harming the parsing of non-question data. when facing out-of-domain data with strong structural divergences. Two thirds being aligned with the English QuestionBank (Judge et al., 2006) and being freely available, this treebank will prove useful to build robust NLP systems.
Fichier principal
Vignette du fichier
lrec2016_QuestionBank.pdf (88.34 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01457184 , version 1 (08-02-2017)
hal-01457184 , version 2 (09-05-2017)

Identifiants

  • HAL Id : hal-01457184 , version 2

Citer

Djamé Seddah, Marie Candito. Hard Time Parsing Questions: Building a QuestionBank for French. Tenth International Conference on Language Resources and Evaluation (LREC 2016), May 2016, Portorož, Slovenia. ⟨hal-01457184v2⟩
368 Consultations
252 Téléchargements

Partager

Gmail Facebook X LinkedIn More