Under-determined reverberant audio source separation using a full-rank spatial covariance model

Ngoc Duong 1 Emmanuel Vincent 1 Rémi Gribonval 1
1 METISS - Speech and sound data modeling and processing
IRISA - Institut de Recherche en Informatique et Systèmes Aléatoires, Inria Rennes – Bretagne Atlantique
Abstract : This article addresses the modeling of reverberant recording environments in the context of under-determined convolutive blind source separation. We model the contribution of each source to all mixture channels in the time-frequency domain as a zero-mean Gaussian random variable whose covari- ance encodes the spatial characteristics of the source. We then consider four specific covariance models, including a full-rank unconstrained model. We derive a family of iterative expectation- maximization (EM) algorithms to estimate the parameters of each model and propose suitable procedures adapted from the state- of-the-art to initialize the parameters and to align the order of the estimated sources across all frequency bins. Experimental results over reverberant synthetic mixtures and live recordings of speech data show the effectiveness of the proposed approach.
Type de document :
Article dans une revue
IEEE Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers, 2010, 18 (7), pp.1830--1840. 〈10.1109/TASL.2010.2050716〉
Liste complète des métadonnées

Littérature citée [24 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/inria-00541865
Contributeur : Rémi Gribonval <>
Soumis le : jeudi 27 janvier 2011 - 21:48:52
Dernière modification le : mercredi 16 mai 2018 - 11:23:03
Document(s) archivé(s) le : jeudi 28 avril 2011 - 02:30:59

Fichier

duong_TASLP10.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

Citation

Ngoc Duong, Emmanuel Vincent, Rémi Gribonval. Under-determined reverberant audio source separation using a full-rank spatial covariance model. IEEE Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers, 2010, 18 (7), pp.1830--1840. 〈10.1109/TASL.2010.2050716〉. 〈inria-00541865〉

Partager

Métriques

Consultations de la notice

609

Téléchargements de fichiers

630