Under-determined reverberant audio source separation using a full-rank spatial covariance model

Ngoc Q. K. Duong; Emmanuel Vincent; Rémi Gribonval

doi:10.1109/TASL.2010.2050716

Journal Articles IEEE Transactions on Audio, Speech and Language Processing Year : 2010

Under-determined reverberant audio source separation using a full-rank spatial covariance model

(1) , (1) , (1)

Ngoc Q. K. Duong

Function : Author
PersonId : 864978

Speech and sound data modeling and processing

Emmanuel Vincent

Function : Author
PersonId : 1256
IdHAL : emmanuelv
ORCID : 0000-0002-0183-7289
IdRef : 089360176

Speech and sound data modeling and processing

Rémi Gribonval

Function : Author
PersonId : 1255
IdHAL : remi-gribonval
ORCID : 0000-0002-9450-8125
IdRef : 113181590

Speech and sound data modeling and processing

Abstract

This article addresses the modeling of reverberant recording environments in the context of under-determined convolutive blind source separation. We model the contribution of each source to all mixture channels in the time-frequency domain as a zero-mean Gaussian random variable whose covari- ance encodes the spatial characteristics of the source. We then consider four specific covariance models, including a full-rank unconstrained model. We derive a family of iterative expectation- maximization (EM) algorithms to estimate the parameters of each model and propose suitable procedures adapted from the state- of-the-art to initialize the parameters and to align the order of the estimated sources across all frequency bins. Experimental results over reverberant synthetic mixtures and live recordings of speech data show the effectiveness of the proposed approach.

Domains

Signal and Image Processing Signal and Image processing

Fichier principal

duong_TASLP10.pdf (806.02 Ko)

Origin : Files produced by the author(s)

Rémi Gribonval : Connect in order to contact the contributor

https://inria.hal.science/inria-00541865

Submitted on : Thursday, January 27, 2011-9:48:52 PM

Last modification on : Friday, March 24, 2023-2:52:53 PM

Long-term archiving on: Thursday, April 28, 2011-2:30:59 AM

Dates and versions

inria-00541865 , version 1 (27-01-2011)

Identifiers

HAL Id : inria-00541865 , version 1
DOI : 10.1109/TASL.2010.2050716

Cite

Ngoc Q. K. Duong, Emmanuel Vincent, Rémi Gribonval. Under-determined reverberant audio source separation using a full-rank spatial covariance model. IEEE Transactions on Audio, Speech and Language Processing, 2010, 18 (7), pp.1830--1840. ⟨10.1109/TASL.2010.2050716⟩. ⟨inria-00541865⟩

Export

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

EC-PARIS UNIV-RENNES1 CNRS INRIA INSA-RENNES IRISA IRISA-D5 INRIA2 UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES INSA-GROUPE ANR UR1-MATH-NUM

437 View

1006 Download

Under-determined reverberant audio source separation using a full-rank spatial covariance model

Abstract

Domains

Dates and versions

Identifiers

Cite

Export

Collections

Altmetric

Share