Skip to Main content Skip to Navigation
Conference papers

On the Use of Machine Translation for Spoken Language Understanding Portability

Christophe Servan 1 Nathalie Camelin 1 Christian Raymond 2 Frédéric Béchet 1 Renato de Mori 1
2 TEXMEX - Multimedia content-based indexing
IRISA - Institut de Recherche en Informatique et Systèmes Aléatoires, Inria Rennes – Bretagne Atlantique
Abstract : Across language portability of a spoken language understanding system (SLU) deals with the possibility of reusing with moderate effort in a new language knowledge and data acquired for another language. The approach proposed in this paper is motivated by the availability of the fairly large MEDIA corpus carefully transcribed in French and semantically annotated in terms of constituents. A method is proposed for manually translating a portion of the training set for training an automatic machine translation (MT) system to be used for translating the remaining data. As the source language is annotated in terms of concept tags, a solution is presented for automatically transferring these tags to the translated corpus. Experimental results are presented on the accuracy of the translation expressed with the BLEU score as function of the size of the training corpus. It is shown that the process leads to comparable concept error rates in the two languages making the proposed approach suitable for SLU portability across languages.
Document type :
Conference papers
Complete list of metadata
Contributor : Patrick Gros Connect in order to contact the contributor
Submitted on : Wednesday, October 6, 2010 - 5:00:26 PM
Last modification on : Thursday, November 25, 2021 - 3:12:06 PM



Christophe Servan, Nathalie Camelin, Christian Raymond, Frédéric Béchet, Renato de Mori. On the Use of Machine Translation for Spoken Language Understanding Portability. IEEE International Conference on Acoustics, Speech, and Signal Processing, IEEE, Mar 2010, Dallas, Texas, United States. pp.5330 - 5333, ⟨10.1109/ICASSP.2010.5494960⟩. ⟨inria-00523967⟩



Les métriques sont temporairement indisponibles