Fully Automated Non-Native Speech Recognition Using Confusion-Based Acoustic Model Integration

Ghazi Bouselmi; Dominique Fohr; Irina Illina; Jean-Paul Haton

Communication Dans Un Congrès Année : 2005

Fully Automated Non-Native Speech Recognition Using Confusion-Based Acoustic Model Integration

(1) , (1) , (1) , (1)

Ghazi Bouselmi

Fonction : Auteur
PersonId : 836336

Analysis, perception and recognition of speech

Dominique Fohr

Fonction : Auteur
PersonId : 15652
IdHAL : dominique-fohr
IdRef : 031092942

Analysis, perception and recognition of speech

Irina Illina

Fonction : Auteur
PersonId : 15663
IdHAL : irina-illina
IdRef : 120731746

Analysis, perception and recognition of speech

Jean-Paul Haton

Fonction : Auteur
PersonId : 830987

Analysis, perception and recognition of speech

Résumé

This paper presents a fully automated approach for the recognition of non-native speech based on acoustic model modification. For a native language (L1) and a spoken language (L2), pronunciation variants of the phones of L2 are automatically extracted from an existing non-native database as a confusion matrix with sequences of phones of L1. This is done using L1's and L2's ASR systems. This confusion concept deals with the problem of non existence of match between some L2 and L1 phones. The confusion matrix is then used to modify the acoustic models (HMMs) of L2 phones by integrating corresponding L1 phone models as alternative HMM paths. In this way, no lexicon modification is carried. The modified ASR system achieved an improvement between 32% and 40% (relative, L1=French and L2=English) in WER on the French non-native database used for testing.

Mots clés

non native speech HMM structure modification

Domaines

Informatique et langage [cs.CL]

Fichier principal

eurospeech2005.pdf (158.34 Ko)

Bouselmi Ghazi : Connectez-vous pour contacter le contributeur

https://inria.hal.science/inria-00111920

Soumis le : lundi 6 novembre 2006-15:39:19

Dernière modification le : jeudi 15 février 2024-03:31:41

Archivage à long terme le : mardi 6 avril 2010-19:08:05

Dates et versions

inria-00111920 , version 1 (06-11-2006)

Identifiants

HAL Id : inria-00111920 , version 1

Citer

Ghazi Bouselmi, Dominique Fohr, Irina Illina, Jean-Paul Haton. Fully Automated Non-Native Speech Recognition Using Confusion-Based Acoustic Model Integration. Interspeech'2005 - Eurospeech — 9th European Conference on Speech Communication and Technology, Sep 2005, Lisbonne, Portugal. pp.1369-1372. ⟨inria-00111920⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-RENNES1 CNRS INRIA IRISA UNIV-LORRAINE INRIA2 LORIA UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES UR1-MATH-NUM

165 Consultations

253 Téléchargements

Fully Automated Non-Native Speech Recognition Using Confusion-Based Acoustic Model Integration

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager