Speech enhancement with variational autoencoders and alpha-stable distributions

his paper focuses on single-channel semi-supervised speech en-hancement. We learn a speaker-independent deep generative speechmodel using the framework of variational autoencoders. The noisemodel remains unsupervised because we do not assume prior knowl-edge of the noisy recording environment. In this context, our con-tribution is to propose a noise model based on alpha-stable distribu-tions, instead of the more conventional Gaussian non-negative ma-trix factorization approach found in previous studies. We develop aMonte Carlo expectation-maximization algorithm for estimating themodel parameters at test time. Experimental results show the supe-riority of the proposed approach both in terms of perceptual qualityand intelligibility of the enhanced speech signal.

Mots clés

Monte Carlo expectation-maximization Variational autoencoders Alpha-stable distribution Speech enhancement

Domaines

Traitement du signal et de l'image [eess.SP] Réseau de neurones [cs.NE]

Fichier principal

LSLGH-icassp2019.pdf (535.25 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Simon Leglaive : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-02005106

Soumis le : vendredi 8 février 2019-11:25:30

Dernière modification le : jeudi 4 avril 2024-21:20:51

Archivage à long terme le : jeudi 9 mai 2019-13:46:30

Dates et versions

hal-02005106 , version 1 (08-02-2019)

Licence

Paternité

Identifiants

HAL Id : hal-02005106 , version 1
ARXIV : 1902.03926
DOI : 10.1109/ICASSP.2019.8682546

Citer

Simon Leglaive, Umut Şimşekli, Antoine Liutkus, Laurent Girin, Radu Horaud. Speech enhancement with variational autoencoders and alpha-stable distributions. ICASSP 2019 - 44th IEEE International Conference on Acoustics, Speech and Signal Processing, May 2019, Brighton, United Kingdom. pp.541-545, ⟨10.1109/ICASSP.2019.8682546⟩. ⟨hal-02005106⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

INSTITUT-TELECOM UNIV-RENNES1 UGA CNRS INRIA IRISA GIPSA GIPSA-DPC PARISTECH LJK LJK_GI LJK_GI_PERCEPTION GIPSA-CRISSP ZENITH LIRMM INRIA2 UR1-MATH-STIC UR1-UFR-ISTIC MIPS UNIV-MONTPELLIER UNIV-RENNES LTCI IDS S2A ANR UR1-MATH-NUM

429 Consultations

592 Téléchargements