Skip to Main content Skip to Navigation
Conference papers

An EM Algorithm for Audio Source Separation Based on the Convolutive Transfer Function

Xiaofei Li 1 Laurent Girin 2, 1 Radu Horaud 1
1 PERCEPTION [2016-2019] - Interpretation and Modelling of Images and Videos [2016-2019]
Inria Grenoble - Rhône-Alpes, LJK - Laboratoire Jean Kuntzmann , Grenoble INP [2007-2019] - Institut polytechnique de Grenoble - Grenoble Institute of Technology [2007-2019]
Abstract : This paper addresses the problem of audio source separation from (possibly under-determined) multichannel convolutive mixtures. We propose a separation method based on the convolutive transfer function (CTF) in the short-time Fourier transform domain. For strongly reverberant signals, the CTF is a much more appropriate model than the widely-used multiplicative transfer function approximation. An Expectation-Maximization (EM) algorithm is proposed to jointly estimate the model parameters, including the CTF coefficients of the mixing filters, and infer the sources. Experiments show that the proposed method provides very satisfactory performance on highly reverberant speech mixtures.
Complete list of metadatas

Cited literature [17 references]  Display  Hide  Download
Contributor : Team Perception <>
Submitted on : Tuesday, July 25, 2017 - 6:12:12 PM
Last modification on : Wednesday, October 7, 2020 - 11:24:19 AM


Files produced by the author(s)



Xiaofei Li, Laurent Girin, Radu Horaud. An EM Algorithm for Audio Source Separation Based on the Convolutive Transfer Function. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, Oct 2017, New Paltz, NY, United States. pp.56-60, ⟨10.1109/WASPAA.2017.8169994⟩. ⟨hal-01568818⟩



Record views


Files downloads