Réseaux Bayésiens Dynamiques pour la Reconnaissance Multi-Bandes de la Parole

Khalid Daoudi 1 Dominique Fohr 1 Christophe Antoine 1
1 PAROLE - Analysis, perception and recognition of speech
INRIA Lorraine, LORIA - Laboratoire Lorrain de Recherche en Informatique et ses Applications
Abstract : This paper presents a new approach to multi-band automatic speech recognition which has the advantage to overcome many limitations of classical muti-band systems. The principle of this new approach is to build a speech model in the time-frequency domain using the formalism of Bayesian networks. Contrarily to classical multi-band modeling, this formalism leads to a probabilistic speech model which allows communications between the different sub-bands and, consequently, no recombination step is required in recognition. We develop efficient learning and decoding algorithms and present illustrative experiments on a connected digit recognition task. The experiments show that the Bayesian network's approach is very promising in the field of noisy speech recognition.
Document type :
Conference papers
Complete list of metadatas

Cited literature [15 references]  Display  Hide  Download

https://hal.inria.fr/inria-00099452
Contributor : Publications Loria <>
Submitted on : Tuesday, September 26, 2006 - 9:09:58 AM
Last modification on : Thursday, January 11, 2018 - 6:19:57 AM
Long-term archiving on: Wednesday, March 29, 2017 - 12:40:29 PM

Identifiers

  • HAL Id : inria-00099452, version 1

Collections

Citation

Khalid Daoudi, Dominique Fohr, Christophe Antoine. Réseaux Bayésiens Dynamiques pour la Reconnaissance Multi-Bandes de la Parole. XXIVe Journées d'Etudes sur la Parole - JEP'2002, Equipe Parole - LORIA, Jun 2002, Nancy, France, 4 p. ⟨inria-00099452⟩

Share

Metrics

Record views

235

Files downloads

214