An End-to-End Evaluation of Two Situated Dialog Systems.

Lina Maria Rojas Barahona 1 Alejandra Lorenzo 1 Claire Gardent 1
1 SYNALP - Natural Language Processing : representations, inference and semantics
LORIA - NLPKD - Department of Natural Language Processing & Knowledge Discovery
Abstract : We present and evaluate two state-of-the art dialogue systems developed to support dialog with French speaking virtual characters in the context of a serious game: one hybrid statistical/symbolic and one purely statistical. We conducted a quantitative evaluation where we compare the accuracy of the interpreter and of the dialog manager used by each system; a user based evaluation based on 22 subjects using both the statistical and the hybrid system; and a corpus based evaluation where we examine such criteria as dialog coherence, dialog success, interpretation and generation errors in the corpus of Human-System interactions collected during the user-based evaluation. We show that although the statistical approach is slightly more robust, the hybrid strategy seems to be better at guiding the player through the game.
Type de document :
Communication dans un congrès
Proceedings of the 13th Annual Meeting of the Special Interest Group on Discourse and Dialogue, Jul 2012, Seoul, North Korea. pp.10-19, 2012
Liste complète des métadonnées

Littérature citée [25 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/hal-00726723
Contributeur : Lina Maria Rojas Barahona <>
Soumis le : vendredi 31 août 2012 - 10:26:02
Dernière modification le : mardi 24 avril 2018 - 13:37:22
Document(s) archivé(s) le : samedi 1 décembre 2012 - 03:30:39

Fichier

emospeech_sigdial.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-00726723, version 1

Collections

Citation

Lina Maria Rojas Barahona, Alejandra Lorenzo, Claire Gardent. An End-to-End Evaluation of Two Situated Dialog Systems.. Proceedings of the 13th Annual Meeting of the Special Interest Group on Discourse and Dialogue, Jul 2012, Seoul, North Korea. pp.10-19, 2012. 〈hal-00726723〉

Partager

Métriques

Consultations de la notice

429

Téléchargements de fichiers

300