Acoustic Model Merging Using Acoustic Models from Multilingual Speakers for Automatic Speech Recognition

Abstract : Many studies have explored on the usage of existing multilingual speech corpora to build an acoustic model for a target language. These works on multilingual acoustic modeling often use multilingual acoustic models to create an initial model. This initial model created is often suboptimal in decoding speech of the target language. Some speech of the target language is then used to adapt and improve the initial model. In this paper however, we investigate multilingual acoustic modeling in enhancing an acoustic model of the target language for automatic speech recognition system. The proposed approach employs context dependent acoustic model merging of a source language to adapt acoustic model of a target language. The source and target language speech are spoken by speakers from the same country. Our experiments on Malay and English automatic speech recognition shows relative improvement in WER from 2% to about 10% when multilingual acoustic model was employed.
Document type :
Conference papers
Liste complète des métadonnées

Cited literature [9 references]  Display  Hide  Download

https://hal.inria.fr/hal-01020180
Contributor : Laurent Besacier <>
Submitted on : Tuesday, July 8, 2014 - 9:17:37 AM
Last modification on : Tuesday, February 12, 2019 - 1:31:31 AM
Document(s) archivé(s) le : Wednesday, October 8, 2014 - 11:05:24 AM

File

merging-ialp2014.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01020180, version 1

Collections

Citation

Tien Ping Tan, Laurent Besacier, Benjamin Lecouteux. Acoustic Model Merging Using Acoustic Models from Multilingual Speakers for Automatic Speech Recognition. International Conference on Asian Language Processing (IALP), Oct 2014, Kuching, Sarawak, Malaysia. ⟨hal-01020180⟩

Share

Metrics

Record views

674

Files downloads

615