Skip to Main content Skip to Navigation
Journal articles

Dynamic Bayesian Networks for multi-band automatic speech recognition

Khalid Daoudi 1 Dominique Fohr 1 Christophe Antoine 1
1 PAROLE - Analysis, perception and recognition of speech
INRIA Lorraine, LORIA - Laboratoire Lorrain de Recherche en Informatique et ses Applications
Abstract : This paper presents a new approach to multi-band automatic speech recognition which has the advantage to overcome many limitations of classical muti-band systems. The principle of this new approach is to build a speech model in the time-frequency domain using the formalism of dynamic Bayesian networks. In contrast to classical multi-band modeling, this formalism leads to a probabilistic speech model which allows communications between the different sub-bands and, consequently, no recombination step is required in recognition. We develop efficient learning and decoding algorithms both for isolated and continuous speech recognition. We present illustrative experiments on isolated and connected digit recognition tasks. These experiments show that the this new approach is very promising in the field of noisy speech recognition.
Document type :
Journal articles
Complete list of metadata

https://hal.inria.fr/inria-00099530
Contributor : Publications Loria Connect in order to contact the contributor
Submitted on : Friday, November 13, 2020 - 1:16:29 PM
Last modification on : Friday, February 26, 2021 - 3:28:06 PM
Long-term archiving on: : Sunday, February 14, 2021 - 6:54:18 PM

File

00099530.pdf
Files produced by the author(s)

Identifiers

Collections

`

Citation

Khalid Daoudi, Dominique Fohr, Christophe Antoine. Dynamic Bayesian Networks for multi-band automatic speech recognition. Computer Speech and Language, Elsevier, 2003, 17 (2-3), pp.263-285. ⟨10.1016/S0885-2308(03)00011-1⟩. ⟨inria-00099530⟩

Share

Metrics

Record views

305

Files downloads

193