An improved uncertainty propagation method for robust i-vector based speaker recognition

Dayana Ribas 1 Emmanuel Vincent 2
2 MULTISPEECH - Speech Modeling for Facilitating Oral-Based Communication
Inria Nancy - Grand Est, LORIA - NLPKD - Department of Natural Language Processing & Knowledge Discovery
Abstract : The performance of automatic speaker recognition systems degrades when facing distorted speech data containing additive noise and/or reverberation. Statistical uncertainty propagation has been introduced as a promising paradigm to address this challenge. So far, different uncertainty propagation methods have been proposed to compensate noise and reverberation in i-vectors in the context of speaker recognition. They have achieved promising results on small datasets such as YOHO and Wall Street Journal, but little or no improvement on the larger, highly variable NIST Speaker Recognition Evaluation (SRE) corpus. In this paper, we propose a complete uncertainty propagation method, whereby we model the effect of uncertainty both in the computation of unbiased Baum-Welch statistics and in the derivation of the posterior expectation of the i-vector. We conduct experiments on the NIST-SRE corpus mixed with real domestic noise and reverberation from the CHiME-2 corpus and preprocessed by multichannel speech enhancement. The proposed method improves the equal error rate (EER) by 4% relative compared to a conventional i-vector based speaker verification baseline. This is to be compared with previous methods which degrade performance.
Document type :
Conference papers
Complete list of metadatas

https://hal.inria.fr/hal-02010199
Contributor : Emmanuel Vincent <>
Submitted on : Tuesday, February 19, 2019 - 9:53:37 AM
Last modification on : Wednesday, April 3, 2019 - 1:22:58 AM
Long-term archiving on : Monday, May 20, 2019 - 1:05:17 PM

Files

ribas_ICASSP19.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-02010199, version 2
  • ARXIV : 1902.05761

Collections

Citation

Dayana Ribas, Emmanuel Vincent. An improved uncertainty propagation method for robust i-vector based speaker recognition. ICASSP 2019 - 44th International Conference on Acoustics, Speech, and Signal Processing, May 2019, Brighton, United Kingdom. ⟨hal-02010199v2⟩

Share

Metrics

Record views

60

Files downloads

167