Analysing Data-To-Text Generation Benchmarks - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 2017

Analysing Data-To-Text Generation Benchmarks

Résumé

A generation system can only be as good as the data it is trained on. In this short paper , we propose a methodology for analysing data-to-text corpora used for training micro-planner i.e., systems which given some input must produce a text verbalising exactly this input. We apply this methodology to three existing benchmarks and we elicite a set of criteria for the creation of a data-to-text benchmark which could help better support the development , evaluation and comparison of linguistically sophisticated data-to-text generators.
Fichier principal
Vignette du fichier
d2tDatasetAnalysis.pdf (138.62 Ko) Télécharger le fichier
Origine : Fichiers éditeurs autorisés sur une archive ouverte
Loading...

Dates et versions

hal-01623832 , version 1 (25-10-2017)

Identifiants

  • HAL Id : hal-01623832 , version 1

Citer

Laura Perez-Beltrachini, Claire Gardent. Analysing Data-To-Text Generation Benchmarks. The 10th International Natural Language Generation conference., Sep 2017, Santiago de Compostelle, Spain. ⟨hal-01623832⟩
108 Consultations
103 Téléchargements

Partager

Gmail Facebook X LinkedIn More