Skip to Main content Skip to Navigation
Conference papers

A diagonal plus low-rank covariance model for computationally efficient source separation

Antoine Liutkus 1 Kazuyoshi Yoshii 2
1 MULTISPEECH - Speech Modeling for Facilitating Oral-Based Communication
Inria Nancy - Grand Est, LORIA - NLPKD - Department of Natural Language Processing & Knowledge Discovery
Abstract : This paper presents an accelerated version of positive semidef-inite tensor factorization (PSDTF) for blind source separation. PSDTF works better than nonnegative matrix factoriza-tion (NMF) by dropping the arguable assumption that audio signals can be whitened in the frequency domain by using short-term Fourier transform (STFT). Indeed, this assumption only holds true in an ideal situation where each frame is infinitely long and the target signal is completely stationary in each frame. PSDTF thus deals with full covariance matrices over frequency bins instead of forcing them to be diagonal as in NMF. Although PSDTF significantly outperforms NMF in terms of separation performance, it suffers from a heavy computational cost due to the repeated inversion of big covariance matrices. To solve this problem, we propose an intermediate model based on diagonal plus low-rank covariance matrices and derive the expectation-maximization (EM) algorithm for efficiently updating the parameters of PSDTF. Experimental results showed that our method can dramatically reduce the complexity of PSDTF by several orders of magnitude without a significant decrease in separation performance. Index Terms— Blind source separation, nonnegative matrix factorization, positive semidefinite tensor factorization, low-rank approximation.
Document type :
Conference papers
Complete list of metadata

Cited literature [17 references]  Display  Hide  Download
Contributor : Antoine Liutkus Connect in order to contact the contributor
Submitted on : Friday, September 1, 2017 - 9:08:57 PM
Last modification on : Wednesday, November 3, 2021 - 7:57:43 AM
Long-term archiving on: : Saturday, December 2, 2017 - 2:26:37 PM


Files produced by the author(s)


  • HAL Id : hal-01580733, version 1



Antoine Liutkus, Kazuyoshi Yoshii. A diagonal plus low-rank covariance model for computationally efficient source separation. IEEE international workshop on machine learning for signal processing (MLSP), Sep 2017, Tokyo, Japan. ⟨hal-01580733⟩



Les métriques sont temporairement indisponibles