Uncertainty propagation for noise robust speaker recognition: the case of NIST-SRE

Dayana Ribas 1 Emmanuel Vincent 2 José Ramon Calvo 1
2 MULTISPEECH - Speech Modeling for Facilitating Oral-Based Communication
Inria Nancy - Grand Est, LORIA - NLPKD - Department of Natural Language Processing & Knowledge Discovery
Abstract : Uncertainty propagation is an established approach to handle noisy and reverberant conditions in automatic speech recognition (ASR), but it has little been studied for speaker recognition so far. Yu et al. recently proposed to propagate uncertainty to the Baum-Welch (BW) statistics without changing the posterior probability of each mixture component. They obtained good results on a small dataset (YOHO) but little improvement on the NIST-SRE dataset, despite the use of oracle uncertainty estimates. In this paper, we propose to modify the computation of the posterior probability of each mixture component in order to obtain unbiased BW statistics. We show that our approach improves the accuracy of BW statistics on the Wall Street Journal (WSJ) corpus, but yields little or no improvement on NIST-SRE again. We provide a theoretical explanation for this that opens the way for more efficient exploitation of uncertainty on NIST-SRE and other large datasets in the future.
Document type :
Conference papers
Complete list of metadatas

Cited literature [32 references]  Display  Hide  Download

https://hal.inria.fr/hal-01158775
Contributor : Emmanuel Vincent <>
Submitted on : Wednesday, August 5, 2015 - 5:15:09 PM
Last modification on : Monday, May 20, 2019 - 3:04:03 PM
Long-term archiving on: Wednesday, April 26, 2017 - 9:50:55 AM

File

UPivector_2015.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01158775, version 3

Citation

Dayana Ribas, Emmanuel Vincent, José Ramon Calvo. Uncertainty propagation for noise robust speaker recognition: the case of NIST-SRE. Interspeech 2015, Sep 2015, Dresden, Germany. pp.5. ⟨hal-01158775v3⟩

Share

Metrics

Record views

444

Files downloads

554