Building and Exploiting a Corpus of Dialog Interactions between French Speaking Virtual and Human Agents

Lina Maria Rojas Barahona 1 Alejandra Lorenzo 1 Claire Gardent 1
1 SYNALP - Natural Language Processing : representations, inference and semantics
LORIA - NLPKD - Department of Natural Language Processing & Knowledge Discovery
Abstract : We describe the acquisition of a dialog corpus for French based on multi-task human-machine interactions in a serious game setting. We present a tool for data collection that is configurable for multiple games; describe the data collected using this tool and the annotation schema used to annotate it; and report on the results obtained when training a classifier on the annotated data to associate each player turn with a dialog move usable by a rule based dialog manager. The collected data consists of approximately 1250 dialogs, 10454 utterances and 168509 words and will be made freely available to academic and nonprofit research.
Type de document :
Communication dans un congrès
Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Thierry Declerck and Mehmet Uğur Doğan and Bente Maegaard and Joseph Mariani and Jan Odijk and Stelios Piperidis. The eighth international conference on Language Resources and Evaluation (LREC), May 2012, Istanbul, Turkey. pp.1428-1435, 2012
Liste complète des métadonnées

https://hal.inria.fr/hal-00726721
Contributeur : Lina Maria Rojas Barahona <>
Soumis le : vendredi 31 août 2012 - 10:17:56
Dernière modification le : jeudi 11 janvier 2018 - 06:23:43
Document(s) archivé(s) le : samedi 1 décembre 2012 - 03:30:31

Fichier

emospeech-lrec12.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-00726721, version 1

Collections

Citation

Lina Maria Rojas Barahona, Alejandra Lorenzo, Claire Gardent. Building and Exploiting a Corpus of Dialog Interactions between French Speaking Virtual and Human Agents. Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Thierry Declerck and Mehmet Uğur Doğan and Bente Maegaard and Joseph Mariani and Jan Odijk and Stelios Piperidis. The eighth international conference on Language Resources and Evaluation (LREC), May 2012, Istanbul, Turkey. pp.1428-1435, 2012. 〈hal-00726721〉

Partager

Métriques

Consultations de la notice

387

Téléchargements de fichiers

144