Skip to Main content Skip to Navigation
Conference papers

An acoustically-motivated spatial prior for under-determined reverberant source separation

Ngoc Duong 1 Emmanuel Vincent 1 Rémi Gribonval 1
1 METISS - Speech and sound data modeling and processing
IRISA - Institut de Recherche en Informatique et Systèmes Aléatoires, Inria Rennes – Bretagne Atlantique
Abstract : We consider the task of under-determined reverberant audio source separation. We model the contribution of each source to all mixture channels in the time-frequency domain as a zero-mean Gaussian random vector with full-rank spatial covariance matrix. We introduce an inverse Wishart prior over the covariance matrices, whose mean is given by the theory of statistical room acoustics and whose variance is learned from training data. We then derive an Expectation-Maximization (EM) algorithm to estimate the model parameters in the Maximum A Posteriori (MAP) sense given prior knowledge about the microphone spacing and the source positions. This algorithm provides a principled solution to the well-known permutation problem and achieves better separation performance than other algorithms exploiting the same prior knowledge.
Complete list of metadatas

Cited literature [7 references]  Display  Hide  Download

https://hal.inria.fr/inria-00566868
Contributor : Ngoc Duong <>
Submitted on : Sunday, February 20, 2011 - 7:57:38 PM
Last modification on : Thursday, March 21, 2019 - 2:20:42 PM
Document(s) archivé(s) le : Saturday, May 21, 2011 - 2:38:42 AM

File

icassp2011.pdf
Files produced by the author(s)

Identifiers

Citation

Ngoc Duong, Emmanuel Vincent, Rémi Gribonval. An acoustically-motivated spatial prior for under-determined reverberant source separation. Acoustics, Speech and Signal Processing, IEEE Conference on (ICASSP'11), May 2011, Prague, Czech Republic. ⟨10.1109/ICASSP.2011.5946315⟩. ⟨inria-00566868⟩

Share

Metrics

Record views

742

Files downloads

349