A Bayesian network for time-frequency speech modeling and recognition

Khalid Daoudi; Dominique Fohr; Christophe Antoine

Communication Dans Un Congrès Année : 2001

A Bayesian network for time-frequency speech modeling and recognition

(1) , (1) , (1)

Khalid Daoudi

Fonction : Auteur
PersonId : 1329075
ORCID : 0000-0003-3536-1060
IdRef : 115483500

Analysis, perception and recognition of speech

Dominique Fohr

Fonction : Auteur
PersonId : 15652
IdHAL : dominique-fohr
IdRef : 031092942

Analysis, perception and recognition of speech

Christophe Antoine

Fonction : Auteur
PersonId : 1035714
IdHAL : christophe-antoine

Analysis, perception and recognition of speech

Résumé

In this paper, we propose a new speech model which is a Bayesian network (BN) built in the time-frequency domain. Contrarily to HMMs, this BN provides a good modeling of the frequency dynamics, particularly the asynchrony between sub-bands. The experiments we carried out show that, consequently, speech is modeled with higher fidelity. Moreover, our new model allows to perform multi-band speech recognition without {\it all} the drawbacks of the usual multi-band approach where each sub-band is independently modeled by a HMM. This makes our model very suited to the case where speech is corrupted by a band-limited noise. We present experiments on an isolated digit recognition task, in clean and noisy conditions. The results we obtain show that the BNs framework is very promising in the field of speech modeling and recognition.

Mots clés

bayesian networks reconnaissance de la parole speech recognition réseaux bayésiens

Domaines

Autre [cs.OH]

Publications Loria : Connectez-vous pour contacter le contributeur

https://inria.hal.science/inria-00100524

Soumis le : mardi 26 septembre 2006-14:46:28

Dernière modification le : jeudi 1 février 2024-10:05:27

Dates et versions

inria-00100524 , version 1 (26-09-2006)

Identifiants

HAL Id : inria-00100524 , version 1

Citer

Khalid Daoudi, Dominique Fohr, Christophe Antoine. A Bayesian network for time-frequency speech modeling and recognition. International Conference on Artificial Intelligence and Soft Computing, May 2001, Cancun, Mexico, 5 p. ⟨inria-00100524⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-RENNES1 CNRS INRIA IRISA UNIV-LORRAINE INRIA2 LORIA UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES UR1-MATH-NUM

123 Consultations

0 Téléchargements

A Bayesian network for time-frequency speech modeling and recognition

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager