On the Use of Machine Translation for Spoken Language Understanding Portability

Christophe Servan 1 Nathalie Camelin 1 Christian Raymond 2 Frédéric Béchet 1 Renato De Mori 1
2 TEXMEX - Multimedia content-based indexing
IRISA - Institut de Recherche en Informatique et Systèmes Aléatoires, Inria Rennes – Bretagne Atlantique
Abstract : Across language portability of a spoken language understanding system (SLU) deals with the possibility of reusing with moderate effort in a new language knowledge and data acquired for another language. The approach proposed in this paper is motivated by the availability of the fairly large MEDIA corpus carefully transcribed in French and semantically annotated in terms of constituents. A method is proposed for manually translating a portion of the training set for training an automatic machine translation (MT) system to be used for translating the remaining data. As the source language is annotated in terms of concept tags, a solution is presented for automatically transferring these tags to the translated corpus. Experimental results are presented on the accuracy of the translation expressed with the BLEU score as function of the size of the training corpus. It is shown that the process leads to comparable concept error rates in the two languages making the proposed approach suitable for SLU portability across languages.
Type de document :
Communication dans un congrès
IEEE International Conference on Acoustics, Speech, and Signal Processing, Mar 2010, Dallas, Texas, United States. pp.5330 - 5333, 2010, 〈http://ieeexplore.ieee.org/iel5/5487364/5494886/05494960.pdf〉. 〈10.1109/ICASSP.2010.5494960〉
Liste complète des métadonnées

https://hal.inria.fr/inria-00523967
Contributeur : Patrick Gros <>
Soumis le : mercredi 6 octobre 2010 - 17:00:26
Dernière modification le : mercredi 21 février 2018 - 01:53:50

Lien texte intégral

Identifiants

Citation

Christophe Servan, Nathalie Camelin, Christian Raymond, Frédéric Béchet, Renato De Mori. On the Use of Machine Translation for Spoken Language Understanding Portability. IEEE International Conference on Acoustics, Speech, and Signal Processing, Mar 2010, Dallas, Texas, United States. pp.5330 - 5333, 2010, 〈http://ieeexplore.ieee.org/iel5/5487364/5494886/05494960.pdf〉. 〈10.1109/ICASSP.2010.5494960〉. 〈inria-00523967〉

Partager

Métriques

Consultations de la notice

256