Extension of sparse, adaptive signal decompositions to semi-blind audio source separation

Andrew Nesbit; Emmanuel Vincent; Mark D. Plumbley

Communication Dans Un Congrès Année : 2009

Extension of sparse, adaptive signal decompositions to semi-blind audio source separation

(1) , (2) , (1)

1
2

Andrew Nesbit

Fonction : Auteur

Centre for Digital Music

Emmanuel Vincent

Fonction : Auteur
PersonId : 1256
IdHAL : emmanuelv
ORCID : 0000-0002-0183-7289
IdRef : 089360176

Speech and sound data modeling and processing

Mark D. Plumbley

Fonction : Auteur

Centre for Digital Music

Résumé

We apply sparse, fast and exible adaptive lapped orthogonal transforms to underdetermined audio source separation using the time-frequency masking framework. This normally requires the sources to overlap as little as possible in the time-frequency plane. In this work, we apply our adaptive transform schemes to the semi-blind case, in which the mixing system is already known, but the sources are unknown. By assuming that exactly two sources are active at each time-frequency index, we determine both the adaptive transforms and the estimated source coefficients using l1 norm minimisation. We show average performance of 12-13 dB SDR on speech and music mixtures, and show that the adaptive transform scheme offers improvements in the order of several tenths of a dB over transforms with constant block length. Comparison with previously studied upper bounds suggests that the potential for future improvements is significant.

Domaines

Traitement du signal et de l'image [eess.SP] Traitement du signal et de l'image [eess.SP]

Fichier principal

nesbit_ICA09.pdf (159.18 Ko)

Origine : Fichiers éditeurs autorisés sur une archive ouverte

Emmanuel Vincent : Connectez-vous pour contacter le contributeur

https://inria.hal.science/inria-00544153

Soumis le : mardi 7 décembre 2010-13:57:19

Dernière modification le : vendredi 24 mars 2023-14:52:53

Archivage à long terme le : mardi 8 mars 2011-04:18:50

Dates et versions

inria-00544153 , version 1 (07-12-2010)

Identifiants

HAL Id : inria-00544153 , version 1

Citer

Andrew Nesbit, Emmanuel Vincent, Mark D. Plumbley. Extension of sparse, adaptive signal decompositions to semi-blind audio source separation. 8th Int. Conf. on Independent Component Analysis and Signal Separation (ICA), Mar 2009, Paraty, Brazil. pp.605--612. ⟨inria-00544153⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

EC-PARIS UNIV-RENNES1 CNRS INRIA INSA-RENNES IRISA IRISA-D5 INRIA2 UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES INSA-GROUPE UR1-MATH-NUM

163 Consultations

290 Téléchargements

Extension of sparse, adaptive signal decompositions to semi-blind audio source separation

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager