HAL will be down for maintenance from Friday, June 10 at 4pm through Monday, June 13 at 9am. More information
Skip to Main content Skip to Navigation
Conference papers

A Bayesian network for time-frequency speech modeling and recognition

Khalid Daoudi 1 Dominique Fohr 1 Christophe Antoine 1
1 PAROLE - Analysis, perception and recognition of speech
INRIA Lorraine, LORIA - Laboratoire Lorrain de Recherche en Informatique et ses Applications
Abstract : In this paper, we propose a new speech model which is a Bayesian network (BN) built in the time-frequency domain. Contrarily to HMMs, this BN provides a good modeling of the frequency dynamics, particularly the asynchrony between sub-bands. The experiments we carried out show that, consequently, speech is modeled with higher fidelity. Moreover, our new model allows to perform multi-band speech recognition without {\it all} the drawbacks of the usual multi-band approach where each sub-band is independently modeled by a HMM. This makes our model very suited to the case where speech is corrupted by a band-limited noise. We present experiments on an isolated digit recognition task, in clean and noisy conditions. The results we obtain show that the BNs framework is very promising in the field of speech modeling and recognition.
Document type :
Conference papers
Complete list of metadata

Contributor : Publications Loria Connect in order to contact the contributor
Submitted on : Tuesday, September 26, 2006 - 2:46:28 PM
Last modification on : Friday, February 26, 2021 - 3:28:05 PM


  • HAL Id : inria-00100524, version 1



Khalid Daoudi, Dominique Fohr, Christophe Antoine. A Bayesian network for time-frequency speech modeling and recognition. International Conference on Artificial Intelligence and Soft Computing, May 2001, Cancun, Mexico, 5 p. ⟨inria-00100524⟩



Record views