HAL will be down for maintenance from Friday, June 10 at 4pm through Monday, June 13 at 9am. More information
Skip to Main content Skip to Navigation
Conference papers

Missing data mask models with global frequency and temporal constraints

Sébastien Demange 1 Christophe Cerisara 1 Jean-Paul Haton 1
1 PAROLE - Analysis, perception and recognition of speech
INRIA Lorraine, LORIA - Laboratoire Lorrain de Recherche en Informatique et ses Applications
Abstract : Missing data recognition has been developped in order to increase noise robustness in automatic speech recognition. Many different factors, including the speech decoding process itself, shall be considered to locate the masks. In this work, we are considering Bayesian models of the masks, where every spectral feature is classified as reliable or masked, and is independent from the rest of the signal. This classification strategy can produce unrelated small ``spots'', while experiments suggest that oracle reliable and unreliable features tend to be clustered into time-frequency blocks. We call this undesired effect: the ``checkerboard'' effect. In this paper, we propose a new Bayesian missing data classifier that integrates frequency and temporal constraints in order to reduce, or avoid, this ``checkerboard'' effect. The proposed classifier is evaluated on the Aurora2 connected digit corpora. Integrating such constraints in the missing data classification leads to significant improvements in recognition accuracy.
Document type :
Conference papers
Complete list of metadata

Cited literature [9 references]  Display  Hide  Download

Contributor : Sébastien Demange Connect in order to contact the contributor
Submitted on : Wednesday, October 4, 2006 - 4:56:44 PM
Last modification on : Friday, February 26, 2021 - 3:28:05 PM
Long-term archiving on: : Tuesday, April 6, 2010 - 6:12:38 PM


  • HAL Id : inria-00103574, version 1



Sébastien Demange, Christophe Cerisara, Jean-Paul Haton. Missing data mask models with global frequency and temporal constraints. Ninth International Conference on Spoken Language Processing - Interspeech 2006 - ICSLP, Sep 2006, Pittsburgh, Pennsylvania/USA. ⟨inria-00103574⟩



Record views


Files downloads