Exploiting confidence measures for missing data speech recognition

Christophe Cerisara

Communication Dans Un Congrès Année : 2008

Exploiting confidence measures for missing data speech recognition

(1)

Christophe Cerisara

Fonction : Auteur
PersonId : 2353
IdHAL : christophe-cerisara
IdRef : 102700168

Analysis, perception and recognition of speech

Résumé

Automatic speech recognition in highly non-stationary noise, for instance with a competing speaker or background music, is an extremely challenging and still unsolved problem. Missing data recognition is a robust approach that is well adapted to this kind of noise. A standard missing data technique consists in marginalizing out, from the observation likelihoods computed during decoding, the contribution of the spectro-temporal fragments that are dominated by noise. However, such an approach can hardly be applied to advanced parameterization domains that do not separate speech from noise frequencies, such as the cepstrum or ETSI AFE. We propose in the work to extend this technique to such parameterization domains, and not only to spectrographic-like front-ends as it was the case before. This is realized by masking the observations that favor erroneous decoding paths, instead of masking the features that are dominated by noise. These new missing data "masks" are now estimated based on speech recognition confidence measures, which can be considered as indicators of the reliability of decoding paths. A first version of this robust algorithm is evaluated on the French broadcast news ESTER corpus.

Domaines

Intelligence artificielle [cs.AI] Interface homme-machine [cs.HC]

Fichier principal

acoustics08.pdf (116.03 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Christophe Cerisara : Connectez-vous pour contacter le contributeur

https://inria.hal.science/inria-00330726

Soumis le : mercredi 10 décembre 2008-10:45:24

Dernière modification le : vendredi 24 mars 2023-14:52:51

Archivage à long terme le : lundi 7 juin 2010-18:27:15

Dates et versions

inria-00330726 , version 1 (10-12-2008)

Identifiants

HAL Id : inria-00330726 , version 1

Citer

Christophe Cerisara. Exploiting confidence measures for missing data speech recognition. Proceedings on Acoustics'08, Jul 2008, Paris, France. ⟨inria-00330726⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA UNIV-LORRAINE INRIA2 LORIA

130 Consultations

190 Téléchargements

Exploiting confidence measures for missing data speech recognition

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager