C. Gardent, A. Shimorina, S. Narayan, and L. Perez-beltrachini, The WebNLG Challenge: Generating Text from RDF Data, Proceedings of the 10th International Conference on Natural Language Generation, pp.124-133, 2017.
URL : https://hal.archives-ouvertes.fr/hal-02461197

C. Gardent, A. Shimorina, S. Narayan, and L. Perez-beltrachini, Creating Training Corpora for NLG Micro-Planners, Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), vol.1, 2017.

D. Hovy, T. Berg-kirkpatrick, A. Vaswani, and E. H. Hovy, Learning whom to trust with mace, HLT-NAACL, pp.1120-1130, 2013.

A. Stent, M. Marge, and M. Singhai, Evaluating Evaluation Methods for Generation in the Presence of Variation, Computational Linguistics and Intelligent Text Processing, pp.341-351, 2005.

D. Elliott and F. Keller, Comparing Automatic Evaluation Measures for Image Description, Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), vol.452, p.457, 2014.

J. Novikova, O. Du?ek, A. Cercas-curry, and V. Rieser, Why We Need New Evaluation Metrics for NLG, Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing, pp.2241-2252, 2017.

C. Callison-burch, P. Koehn, C. Monz, and J. Schroeder, Findings of the 2009 workshop on statistical machine translation, Proceedings of the Fourth Workshop on Statistical Machine Translation - StatMT '09, pp.1-28, 2009.

R. Bernardi, R. Cakici, D. Elliott, A. Erdem, E. Erdem et al., Automatic Description Generation from Images: A Survey of Models, Datasets, and Evaluation Measures, Journal of Artificial Intelligence Research, vol.55, pp.409-442, 2016.