HAL will be down for maintenance from Friday, June 10 at 4pm through Monday, June 13 at 9am. More information
Skip to Main content Skip to Navigation
Conference papers

Hidden Factor Dynamic Bayesian Networks for Speech Recognition

Filipp Korkmazsky 1 Murat Deviren 1 Dominique Fohr 1 Irina Illina 1
1 PAROLE - Analysis, perception and recognition of speech
INRIA Lorraine, LORIA - Laboratoire Lorrain de Recherche en Informatique et ses Applications
Abstract : This paper presents a novel approach to modeling speech data by Dynamic Bayesian Networks. Instead of defining a specific set of factors that affect speech signals the factors are modeled implicitly by speech data clustering. Different data clusters correspond to different subsets of the factor values. These subsets are represented by the corresponding factor states. The factor states along with the phone states represent 2 hidden layers in the Hidden Factor Dynamic Bayesian Network (HFDBN). In this study we proved that Hidden Factor Dynamic Bayesian Networks provide a better speech recognition performance than HMMs of equal complexity. Speech recognition experiments were conducted on the speech data recorded in a moving car and demonstrated advantage of using HFDBN over HMM for clean and noisy speech data recognition.
Document type :
Conference papers
Complete list of metadata

Contributor : Publications Loria Connect in order to contact the contributor
Submitted on : Tuesday, September 26, 2006 - 10:13:41 AM
Last modification on : Friday, February 26, 2021 - 3:28:05 PM


  • HAL Id : inria-00100045, version 1



Filipp Korkmazsky, Murat Deviren, Dominique Fohr, Irina Illina. Hidden Factor Dynamic Bayesian Networks for Speech Recognition. 8th International Conference on Spoken Language Processing - ICSLP'2004, 2004, Jeju, Corée du Sud, 4 p. ⟨inria-00100045⟩



Record views