Leveraging study of robustness and portability of spoken language understanding systems across languages and domains: the PORTMEDIA corpora

Abstract : The PORTMEDIA project is intended to develop new corpora for the evaluation of spoken language understanding systems. The newly collected data are in the field of human-machine dialogue systems for tourist information in French in line with the MEDIA corpus. Transcriptions and semantic annotations, obtained by low-cost procedures, are provided to allow a thorough evaluation of the systems' capabilities in terms of robustness and portability across languages and domains. A new test set with some adaptation data is prepared for each case: in Italian as an example of a new language, for ticket reservation as an example of a new domain. Finally the work is complemented by the proposition of a new high level semantic annotation scheme well-suited to dialogue data.
Document type :
Conference papers
Complete list of metadatas

Cited literature [20 references]  Display  Hide  Download

https://hal.inria.fr/hal-00683433
Contributor : Lina Maria Rojas Barahona <>
Submitted on : Wednesday, March 28, 2012 - 4:53:42 PM
Last modification on : Wednesday, May 15, 2019 - 10:12:03 AM
Long-term archiving on : Monday, November 26, 2012 - 12:21:09 PM

File

751_Paper.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-00683433, version 1

Citation

Fabrice Lefèvre, Djamel Mostefa, Laurent Besacier, Yannick Estève, Matthieu Quignard, et al.. Leveraging study of robustness and portability of spoken language understanding systems across languages and domains: the PORTMEDIA corpora. The International Conference on Language Resources and Evaluation, May 2012, Istanbul, Turkey. ⟨hal-00683433⟩

Share

Metrics

Record views

1361

Files downloads

672