Leveraging study of robustness and portability of spoken language understanding systems across languages and domains: the PORTMEDIA corpora - Archive ouverte HAL Access content directly
Conference Papers Year : 2012

Leveraging study of robustness and portability of spoken language understanding systems across languages and domains: the PORTMEDIA corpora

(1) , (2) , (3) , (4) , (5) , (4) , (6, 7) , (1, 8) , (5)
1
2
3
4
5
6
7
8
Fabrice Lefèvre

Abstract

The PORTMEDIA project is intended to develop new corpora for the evaluation of spoken language understanding systems. The newly collected data are in the field of human-machine dialogue systems for tourist information in French in line with the MEDIA corpus. Transcriptions and semantic annotations, obtained by low-cost procedures, are provided to allow a thorough evaluation of the systems' capabilities in terms of robustness and portability across languages and domains. A new test set with some adaptation data is prepared for each case: in Italian as an example of a new language, for ticket reservation as an example of a new domain. Finally the work is complemented by the proposition of a new high level semantic annotation scheme well-suited to dialogue data.
Fichier principal
Vignette du fichier
751_Paper.pdf (151.1 Ko) Télécharger le fichier
Origin : Files produced by the author(s)
Loading...

Dates and versions

hal-00683433 , version 1 (28-03-2012)

Identifiers

  • HAL Id : hal-00683433 , version 1

Cite

Fabrice Lefèvre, Djamel Mostefa, Laurent Besacier, Yannick Estève, Matthieu Quignard, et al.. Leveraging study of robustness and portability of spoken language understanding systems across languages and domains: the PORTMEDIA corpora. The International Conference on Language Resources and Evaluation, May 2012, Istanbul, Turkey. ⟨hal-00683433⟩
924 View
475 Download

Share

Gmail Facebook Twitter LinkedIn More