Skip to Main content Skip to Navigation
Conference papers

Fully Automated Non-Native Speech Recognition Using Confusion-Based Acoustic Model Integration

Ghazi Bouselmi 1 Dominique Fohr 1 Irina Illina 1 Jean-Paul Haton 1
1 PAROLE - Analysis, perception and recognition of speech
INRIA Lorraine, LORIA - Laboratoire Lorrain de Recherche en Informatique et ses Applications
Abstract : This paper presents a fully automated approach for the recognition of non-native speech based on acoustic model modification. For a native language (L1) and a spoken language (L2), pronunciation variants of the phones of L2 are automatically extracted from an existing non-native database as a confusion matrix with sequences of phones of L1. This is done using L1's and L2's ASR systems. This confusion concept deals with the problem of non existence of match between some L2 and L1 phones. The confusion matrix is then used to modify the acoustic models (HMMs) of L2 phones by integrating corresponding L1 phone models as alternative HMM paths. In this way, no lexicon modification is carried. The modified ASR system achieved an improvement between 32% and 40% (relative, L1=French and L2=English) in WER on the French non-native database used for testing.
Document type :
Conference papers
Complete list of metadata

Cited literature [6 references]  Display  Hide  Download
Contributor : Bouselmi Ghazi Connect in order to contact the contributor
Submitted on : Monday, November 6, 2006 - 3:39:19 PM
Last modification on : Friday, February 26, 2021 - 3:28:05 PM
Long-term archiving on: : Tuesday, April 6, 2010 - 7:08:05 PM


  • HAL Id : inria-00111920, version 1



Ghazi Bouselmi, Dominique Fohr, Irina Illina, Jean-Paul Haton. Fully Automated Non-Native Speech Recognition Using Confusion-Based Acoustic Model Integration. Interspeech'2005 - Eurospeech — 9th European Conference on Speech Communication and Technology, Sep 2005, Lisbonne, Portugal. pp.1369-1372. ⟨inria-00111920⟩



Record views


Files downloads