Alpha-stable low-rank plus residual decomposition for speech enhancement

In this study, we propose a novel probabilistic model for separating clean speech signals from noisy mixtures by decomposing the mixture spectrograms into a structured speech part and a more flexible residual part. The main novelty in our model is that it uses a family of heavy-tailed distributions, so called the α-stable distributions, for modeling the residual signal. We develop an expectation-maximization algorithm for parameter estimation and a Monte Carlo scheme for posterior estimation of the clean speech. Our experiments show that the proposed method outperforms relevant factorization-based algorithms by a significant margin.

Mots clés

Audio source separation Monte Carlo Expectation-Maximization Alpha-stable distributions Speech enhancement

Domaines

Traitement du signal et de l'image [eess.SP]

Fichier principal

2017102794510_839706_2832.pdf (310.34 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Antoine Liutkus : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-01714909

Soumis le : jeudi 22 février 2018-07:51:25

Dernière modification le : jeudi 1 février 2024-10:05:48

Archivage à long terme le : mercredi 23 mai 2018-12:11:22

Dates et versions

hal-01714909 , version 1 (22-02-2018)

Identifiants

HAL Id : hal-01714909 , version 1
DOI : 10.1109/ICASSP.2018.8461539

Citer

Umut Şimşekli, Halil Erdogan, Simon Leglaive, Antoine Liutkus, Roland Badeau, et al.. Alpha-stable low-rank plus residual decomposition for speech enhancement. ICASSP: International Conference on Acoustics, Speech, and Signal Processing, Apr 2018, Calgary, Canada. pp.651-655, ⟨10.1109/ICASSP.2018.8461539⟩. ⟨hal-01714909⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

INSTITUT-TELECOM UNIV-RENNES1 CNRS INRIA IRISA PARISTECH ZENITH LIRMM INRIA2 UR1-MATH-STIC UR1-UFR-ISTIC MIPS UNIV-MONTPELLIER UNIV-RENNES LTCI IDS S2A ANR UR1-MATH-NUM

496 Consultations

435 Téléchargements