Building and Exploiting a Corpus of Dialog Interactions between French Speaking Virtual and Human Agents - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 2012

Building and Exploiting a Corpus of Dialog Interactions between French Speaking Virtual and Human Agents

Résumé

We describe the acquisition of a dialog corpus for French based on multi-task human-machine interactions in a serious game setting. We present a tool for data collection that is configurable for multiple games; describe the data collected using this tool and the annotation schema used to annotate it; and report on the results obtained when training a classifier on the annotated data to associate each player turn with a dialog move usable by a rule based dialog manager. The collected data consists of approximately 1250 dialogs, 10454 utterances and 168509 words and will be made freely available to academic and nonprofit research.
Fichier principal
Vignette du fichier
emospeech-lrec12.pdf (313.43 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-00726721 , version 1 (31-08-2012)

Identifiants

  • HAL Id : hal-00726721 , version 1

Citer

Lina Maria Rojas Barahona, Alejandra Lorenzo, Claire Gardent. Building and Exploiting a Corpus of Dialog Interactions between French Speaking Virtual and Human Agents. The eighth international conference on Language Resources and Evaluation (LREC), European Language Resources Association (ELRA), May 2012, Istanbul, Turkey. pp.1428-1435. ⟨hal-00726721⟩
375 Consultations
167 Téléchargements

Partager

Gmail Facebook X LinkedIn More