A diagonal plus low-rank covariance model for computationally efficient source separation

Antoine Liutkus 1, 2 Kazuyoshi Yoshii 3
2 MULTISPEECH - Speech Modeling for Facilitating Oral-Based Communication
Inria Nancy - Grand Est, LORIA - NLPKD - Department of Natural Language Processing & Knowledge Discovery
Abstract : This paper presents an accelerated version of positive semidef-inite tensor factorization (PSDTF) for blind source separation. PSDTF works better than nonnegative matrix factoriza-tion (NMF) by dropping the arguable assumption that audio signals can be whitened in the frequency domain by using short-term Fourier transform (STFT). Indeed, this assumption only holds true in an ideal situation where each frame is infinitely long and the target signal is completely stationary in each frame. PSDTF thus deals with full covariance matrices over frequency bins instead of forcing them to be diagonal as in NMF. Although PSDTF significantly outperforms NMF in terms of separation performance, it suffers from a heavy computational cost due to the repeated inversion of big covariance matrices. To solve this problem, we propose an intermediate model based on diagonal plus low-rank covariance matrices and derive the expectation-maximization (EM) algorithm for efficiently updating the parameters of PSDTF. Experimental results showed that our method can dramatically reduce the complexity of PSDTF by several orders of magnitude without a significant decrease in separation performance. Index Terms— Blind source separation, nonnegative matrix factorization, positive semidefinite tensor factorization, low-rank approximation.
Type de document :
Communication dans un congrès
IEEE international workshop on machine learning for signal processing (MLSP), Sep 2017, Tokyo, Japan. 2017
Liste complète des métadonnées

Littérature citée [17 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/hal-01580733
Contributeur : Antoine Liutkus <>
Soumis le : vendredi 1 septembre 2017 - 21:08:57
Dernière modification le : jeudi 22 février 2018 - 08:46:14
Document(s) archivé(s) le : samedi 2 décembre 2017 - 14:26:37

Fichier

mlsp-2017-liutkus.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-01580733, version 1

Citation

Antoine Liutkus, Kazuyoshi Yoshii. A diagonal plus low-rank covariance model for computationally efficient source separation. IEEE international workshop on machine learning for signal processing (MLSP), Sep 2017, Tokyo, Japan. 2017. 〈hal-01580733〉

Partager

Métriques

Consultations de la notice

203

Téléchargements de fichiers

247