Skip to Main content Skip to Navigation
New interface
Conference papers

An EM Algorithm for Audio Source Separation Based on the Convolutive Transfer Function

Xiaofei Li 1 Laurent Girin 2, 1 Radu Horaud 1 
1 PERCEPTION - Interpretation and Modelling of Images and Videos
Inria Grenoble - Rhône-Alpes, Grenoble INP - Institut polytechnique de Grenoble - Grenoble Institute of Technology, LJK - Laboratoire Jean Kuntzmann
Abstract : This paper addresses the problem of audio source separation from (possibly under-determined) multichannel convolutive mixtures. We propose a separation method based on the convolutive transfer function (CTF) in the short-time Fourier transform domain. For strongly reverberant signals, the CTF is a much more appropriate model than the widely-used multiplicative transfer function approximation. An Expectation-Maximization (EM) algorithm is proposed to jointly estimate the model parameters, including the CTF coefficients of the mixing filters, and infer the sources. Experiments show that the proposed method provides very satisfactory performance on highly reverberant speech mixtures.
Complete list of metadata

Cited literature [17 references]  Display  Hide  Download
Contributor : Perception team Connect in order to contact the contributor
Submitted on : Tuesday, July 25, 2017 - 6:12:12 PM
Last modification on : Wednesday, May 4, 2022 - 12:12:05 PM


Files produced by the author(s)



Xiaofei Li, Laurent Girin, Radu Horaud. An EM Algorithm for Audio Source Separation Based on the Convolutive Transfer Function. WASPAA 2017 - IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, Oct 2017, New Paltz, NY, United States. pp.56-60, ⟨10.1109/WASPAA.2017.8169994⟩. ⟨hal-01568818⟩



Record views


Files downloads