Benchmarking flexible adaptive time-frequency transforms for underdetermined audio source separation

Andrew Nesbit; Emmanuel Vincent; Mark D. Plumbley

Communication Dans Un Congrès Année : 2009

Benchmarking flexible adaptive time-frequency transforms for underdetermined audio source separation

(1) , (2) , (1)

1
2

Andrew Nesbit

Fonction : Auteur

Centre for Digital Music

Emmanuel Vincent

Fonction : Auteur
PersonId : 1256
IdHAL : emmanuelv
ORCID : 0000-0002-0183-7289
IdRef : 089360176

Speech and sound data modeling and processing

Mark D. Plumbley

Fonction : Auteur

Centre for Digital Music

Résumé

We have implemented several fast and flexible adaptive lapped orthogonal transform (LOT) schemes for underdetermined audio source separation. This is generally addressed by time-frequency masking, requiring the sources to be disjoint in the time-frequency domain. We have already shown that disjointness can be increased via adaptive dyadic LOTs. By taking inspiration from the windowing schemes used in many audio coding frameworks, we improve on earlier results in two ways. Firstly, we consider non-dyadic LOTs which match the time-varying signal structures better. Secondly, we allow for a greater range of overlapping window profiles to decrease window boundary artifacts. This new scheme is benchmarked through oracle evaluations, and is shown to decrease computation time by over an order of magnitude compared to using very general schemes, whilst maintaining high separation performance and flexible signal adaptivity. As the results demonstrate, this work may find practical applications in high fidelity audio source separation.

Domaines

Traitement du signal et de l'image [eess.SP] Traitement du signal et de l'image [eess.SP]

Fichier principal

nesbit_ICASSP09.pdf (102.58 Ko)

Origine : Fichiers éditeurs autorisés sur une archive ouverte

Emmanuel Vincent : Connectez-vous pour contacter le contributeur

https://inria.hal.science/inria-00544160

Soumis le : mardi 7 décembre 2010-14:04:27

Dernière modification le : vendredi 24 mars 2023-14:52:53

Archivage à long terme le : lundi 5 novembre 2012-12:31:18

Dates et versions

inria-00544160 , version 1 (07-12-2010)

Identifiants

HAL Id : inria-00544160 , version 1

Citer

Andrew Nesbit, Emmanuel Vincent, Mark D. Plumbley. Benchmarking flexible adaptive time-frequency transforms for underdetermined audio source separation. 2009 IEEE Int. Conf. on Acoustics, Speech and Signal Processing (ICASSP), Apr 2009, Taipei, Taiwan. pp.37--40. ⟨inria-00544160⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

EC-PARIS UNIV-RENNES1 CNRS INRIA INSA-RENNES IRISA IRISA-D5 INRIA2 UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES INSA-GROUPE UR1-MATH-NUM

170 Consultations

162 Téléchargements

Benchmarking flexible adaptive time-frequency transforms for underdetermined audio source separation

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager