A Bayesian network for time-frequency speech modeling and recognition - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 2001

A Bayesian network for time-frequency speech modeling and recognition

Résumé

In this paper, we propose a new speech model which is a Bayesian network (BN) built in the time-frequency domain. Contrarily to HMMs, this BN provides a good modeling of the frequency dynamics, particularly the asynchrony between sub-bands. The experiments we carried out show that, consequently, speech is modeled with higher fidelity. Moreover, our new model allows to perform multi-band speech recognition without {\it all} the drawbacks of the usual multi-band approach where each sub-band is independently modeled by a HMM. This makes our model very suited to the case where speech is corrupted by a band-limited noise. We present experiments on an isolated digit recognition task, in clean and noisy conditions. The results we obtain show that the BNs framework is very promising in the field of speech modeling and recognition.

Domaines

Autre [cs.OH]
Fichier non déposé

Dates et versions

inria-00100524 , version 1 (26-09-2006)

Identifiants

  • HAL Id : inria-00100524 , version 1

Citer

Khalid Daoudi, Dominique Fohr, Christophe Antoine. A Bayesian network for time-frequency speech modeling and recognition. International Conference on Artificial Intelligence and Soft Computing, May 2001, Cancun, Mexico, 5 p. ⟨inria-00100524⟩
123 Consultations
0 Téléchargements

Partager

Gmail Facebook X LinkedIn More