Variational Bayesian Inference for Source Separation and Robust Feature Extraction

Kamil Adiloğlu; Emmanuel Vincent

doi:10.1109/TASLP.2016.2583794

Article Dans Une Revue IEEE Transactions on Audio, Speech and Language Processing Année : 2016

Variational Bayesian Inference for Source Separation and Robust Feature Extraction

(1) , (2)

1
2

Kamil Adiloğlu

Fonction : Auteur
PersonId : 952373

Hörtech gGmbH

Emmanuel Vincent

Fonction : Auteur
PersonId : 1256
IdHAL : emmanuelv
ORCID : 0000-0002-0183-7289
IdRef : 089360176

Speech Modeling for Facilitating Oral-Based Communication

Résumé

We consider the task of separating and classifying individual sound sources mixed together. The main challenge is to achieve robust classification despite residual distortion of the separated source signals. A promising paradigm is to estimate the uncertainty about the separated source signals and to propagate it through the subsequent feature extraction and classification stages. We argue that variational Bayesian (VB) inference offers a mathematically rigorous way of deriving uncertainty estimators, which contrasts with state-of-the-art estimators based on heuristics or on maximum likelihood (ML) estimation. We propose a general VB source separation algorithm, which makes it possible to jointly exploit spatial and spectral models of the sources. This algorithm achieves 6% and 5% relative error reduction compared to ML uncertainty estimation on the CHiME noise-robust speaker identification and speech recognition benchmarks, respectively, and it opens the way for more complex VB approximations of uncertainty.

Dans cet article, nous considérons le problème de l'extraction des descripteurs de chaque source dans un enregistrement audio multi-sources à l'aide d'un algorithme général de séparation de sources. La difficulté consiste à estimer l'incertitude sur les sources et à la propager aux descripteurs, afin de les estimer de façon robuste en dépit des erreurs de séparation. Les méthodes de l'état de l'art estiment l'incertitude de façon heuristique, tandis que nous proposons d'intégrer sur les paramètres de l'algorithme de séparation de sources. Nous décrivons dans ce but une méthode d'inférence variationnelle bayésienne pour l'estimation de la distribution a posteriori des sources et nous calculons ensuite l'espérance des descripteurs par propagation de l'incertitude selon la méthode d'identification des moments. Nous évaluons la précision des descripteurs en terme d'erreur quadratique moyenne et conduisons des expériences de reconnaissance du locuteur afin d'observer la performance qui en découle pour un problème réel. Dans les deux cas, la méthode proposée donne les meilleurs résultats.

Domaines

Traitement du signal et de l'image [eess.SP] Traitement du signal et de l'image [eess.SP]

Fichier principal

double.pdf (417.5 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Kamil Adiloglu : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-00726146

Soumis le : vendredi 8 juillet 2016-09:15:56

Dernière modification le : jeudi 1 février 2024-10:03:33

Dates et versions

hal-00726146 , version 1 (29-08-2012)

hal-00726146 , version 2 (08-07-2016)

Identifiants

HAL Id : hal-00726146 , version 2
DOI : 10.1109/TASLP.2016.2583794

Citer

Kamil Adiloğlu, Emmanuel Vincent. Variational Bayesian Inference for Source Separation and Robust Feature Extraction. IEEE Transactions on Audio, Speech and Language Processing, 2016, ⟨10.1109/TASLP.2016.2583794⟩. ⟨hal-00726146v2⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-RENNES1 CNRS INRIA IRISA GRID5000 UNIV-LORRAINE INRIA2 LORIA LORIA-NLPKD UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES SILECS UR1-MATH-NUM

482 Consultations

1115 Téléchargements

Variational Bayesian Inference for Source Separation and Robust Feature Extraction

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager