Audio Source Separation Based on Convolutive Transfer Function and Frequency-Domain Lasso Optimization

Xiaofei Li; Laurent Girin; Radu Horaud

doi:10.1109/ICASSP.2017.7952214

Communication Dans Un Congrès Année : 2017

Audio Source Separation Based on Convolutive Transfer Function and Frequency-Domain Lasso Optimization

(1) , (2, 1) , (1)

1
2

Xiaofei Li

Fonction : Auteur

Interpretation and Modelling of Images and Videos

Laurent Girin

Fonction : Auteur
PersonId : 3682
IdHAL : laurent-girin
ORCID : 0000-0002-9214-8760
IdRef : 088998037

GIPSA - Cognitive Robotics, Interactive Systems, & Speech Processing

Interpretation and Modelling of Images and Videos

Radu Horaud

Fonction : Auteur
PersonId : 16183
IdHAL : radu-horaud
ORCID : 0000-0001-5232-024X
IdRef : 032302495

Interpretation and Modelling of Images and Videos

Résumé

This paper addresses the problem of under-determined convolutive audio source separation in a semi-oracle configuration where the mixing filters are assumed to be known. We propose a separation procedure based on the convolutive transfer function (CTF), which is a more appropriate model for strongly reverberant signals than the widely-used multi-plicative transfer function approximation. In the short-time Fourier transform domain, source signals are estimated by minimizing the mixture fitting cost using Lasso optimization, with a $l_1$-norm regularization to exploit the spectral sparsity of source signals. Experiments show that the proposed method achieves satisfactory performance on highly reverberant speech mixtures, with a much lower computational cost compared to time-domain dual techniques.

Mots clés

convolutive transfer function $l_1$-norm regularization Source separation

Domaines

Acoustique [physics.class-ph] Son [cs.SD] Traitement du signal et de l'image [eess.SP]

Fichier principal

ctf_ss.pdf (251.25 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Perception team : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-01430754

Soumis le : mardi 10 janvier 2017-11:18:18

Dernière modification le : jeudi 4 avril 2024-20:52:39

Archivage à long terme le : mardi 11 avril 2017-14:11:10

Dates et versions

hal-01430754 , version 1 (10-01-2017)

Identifiants

HAL Id : hal-01430754 , version 1
DOI : 10.1109/ICASSP.2017.7952214

Citer

Xiaofei Li, Laurent Girin, Radu Horaud. Audio Source Separation Based on Convolutive Transfer Function and Frequency-Domain Lasso Optimization. ICASSP 2017 - IEEE International Conference on Acoustics, Speech and Signal Processing, Mar 2017, New Orleans, United States. pp.541-545, ⟨10.1109/ICASSP.2017.7952214⟩. ⟨hal-01430754⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-RENNES1 UGA CNRS INRIA IRISA GIPSA GIPSA-DPC LJK LJK_GI LJK_GI_PERCEPTION GIPSA-CRISSP INRIA2 UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES UR1-MATH-NUM

674 Consultations

742 Téléchargements

Audio Source Separation Based on Convolutive Transfer Function and Frequency-Domain Lasso Optimization

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager