Speech enhancement based on nonnegative matrix factorization with mixed group sparsity constraint

Hien-Thanh Duong; Quoc-Cuong Nguyen; Cong-Phuong Nguyen; Thanh-Huan Tran; Ngoc Q. K. Duong

doi:10.1145/2833258.2833276

Communication Dans Un Congrès Année : 2015

Speech enhancement based on nonnegative matrix factorization with mixed group sparsity constraint

(1, 2) , (2, 3) , (3) , (4) , (5)

1
2
3
4
5

Hien-Thanh Duong

Fonction : Auteur

Hanoi University of Mining and Geology

International Research Institute MICA

Quoc-Cuong Nguyen

Fonction : Auteur

International Research Institute MICA

Hanoi University of Science and Technology

Cong-Phuong Nguyen

Fonction : Auteur

Hanoi University of Science and Technology

Thanh-Huan Tran

Fonction : Auteur

Hanoi University of Industry

Ngoc Q. K. Duong

Fonction : Auteur

Technicolor R & I [Cesson Sévigné]

Résumé

This paper addresses a challenging single-channel speech enhancement problem in real-world environment where speech signal is corrupted by high level background noise. While most state-of-the-art algorithms tries to estimate noise spectral power and filter it from the observed one to obtain enhanced speech, the paper discloses another approach inspired from audio source separation technique. In the considered method, generic spectral characteristics of speech and noise are first learned from various training signals by non-negative matrix factorization (NMF). They are then used to guide the similar factorization of the observed power spectrogram into speech part and noise part. Additionally, we propose to combine two existing group sparsity-inducing penalties in the optimization process and adapt the corresponding algorithm for parameter estimation based on mul-tiplicative update (MU) rule. Experiment results over different settings confirm the effectiveness of the proposed approach .

Mots clés

Speech enhancement audio source separation nonnegative matrix factorization multiplicative update spectral model group sparsity

Domaines

Machine Learning [stat.ML] Interface homme-machine [cs.HC] Traitement du signal et de l'image [eess.SP]

Fichier principal

paper_review.pdf (439.52 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Ngoc Duong : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-01288291

Soumis le : mardi 15 mars 2016-11:06:10

Dernière modification le : jeudi 4 avril 2024-18:20:20

Archivage à long terme le : jeudi 16 juin 2016-10:34:33

Dates et versions

hal-01288291 , version 1 (15-03-2016)

Identifiants

HAL Id : hal-01288291 , version 1
DOI : 10.1145/2833258.2833276

Citer

Hien-Thanh Duong, Quoc-Cuong Nguyen, Cong-Phuong Nguyen, Thanh-Huan Tran, Ngoc Q. K. Duong. Speech enhancement based on nonnegative matrix factorization with mixed group sparsity constraint. 6th ACM International Symposium on Information and Communication Technology, Dec 2015, Hanoi, Vietnam. ⟨10.1145/2833258.2833276⟩. ⟨hal-01288291⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UGA CNRS

61 Consultations

433 Téléchargements

Speech enhancement based on nonnegative matrix factorization with mixed group sparsity constraint

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager