# Audio Source Separation Based on Convolutive Transfer Function and Frequency-Domain Lasso Optimization

1 PERCEPTION - Interpretation and Modelling of Images and Videos
Inria Grenoble - Rhône-Alpes, LJK - Laboratoire Jean Kuntzmann, INPG - Institut National Polytechnique de Grenoble
2 GIPSA-CRISSP - CRISSP
GIPSA-DPC - Département Parole et Cognition, GIPSA-PPC - GIPSA Pôle Parole et Cognition
Abstract : This paper addresses the problem of under-determined convolutive audio source separation in a semi-oracle configuration where the mixing filters are assumed to be known. We propose a separation procedure based on the convolutive transfer function (CTF), which is a more appropriate model for strongly reverberant signals than the widely-used multi-plicative transfer function approximation. In the short-time Fourier transform domain, source signals are estimated by minimizing the mixture fitting cost using Lasso optimization, with a $l_1$-norm regularization to exploit the spectral sparsity of source signals. Experiments show that the proposed method achieves satisfactory performance on highly reverberant speech mixtures, with a much lower computational cost compared to time-domain dual techniques.
Keywords :
Document type :
Conference papers
Domain :

Cited literature [18 references]

https://hal.inria.fr/hal-01430754
Contributor : Team Perception <>
Submitted on : Tuesday, January 10, 2017 - 11:18:18 AM
Last modification on : Monday, January 13, 2020 - 11:02:28 AM
Long-term archiving on: Tuesday, April 11, 2017 - 2:11:10 PM

### File

ctf_ss.pdf
Files produced by the author(s)

### Citation

Xiaofei Li, Laurent Girin, Radu Horaud. Audio Source Separation Based on Convolutive Transfer Function and Frequency-Domain Lasso Optimization. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2017), Mar 2017, New Orleans, United States. pp.541-545, ⟨10.1109/ICASSP.2017.7952214⟩. ⟨hal-01430754⟩

Record views