Benchmarking flexible adaptive time-frequency transforms for underdetermined audio source separation

Andrew Nesbit 1 Emmanuel Vincent 2 Mark Plumbley 1
2 METISS - Speech and sound data modeling and processing
IRISA - Institut de Recherche en Informatique et Systèmes Aléatoires, Inria Rennes – Bretagne Atlantique
Abstract : We have implemented several fast and flexible adaptive lapped orthogonal transform (LOT) schemes for underdetermined audio source separation. This is generally addressed by time-frequency masking, requiring the sources to be disjoint in the time-frequency domain. We have already shown that disjointness can be increased via adaptive dyadic LOTs. By taking inspiration from the windowing schemes used in many audio coding frameworks, we improve on earlier results in two ways. Firstly, we consider non-dyadic LOTs which match the time-varying signal structures better. Secondly, we allow for a greater range of overlapping window profiles to decrease window boundary artifacts. This new scheme is benchmarked through oracle evaluations, and is shown to decrease computation time by over an order of magnitude compared to using very general schemes, whilst maintaining high separation performance and flexible signal adaptivity. As the results demonstrate, this work may find practical applications in high fidelity audio source separation.
Type de document :
Communication dans un congrès
2009 IEEE Int. Conf. on Acoustics, Speech and Signal Processing (ICASSP), Apr 2009, Taipei, Taiwan. pp.37--40, 2009
Liste complète des métadonnées

Littérature citée [10 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/inria-00544160
Contributeur : Emmanuel Vincent <>
Soumis le : mardi 7 décembre 2010 - 14:04:27
Dernière modification le : mercredi 16 mai 2018 - 11:23:03
Document(s) archivé(s) le : lundi 5 novembre 2012 - 12:31:18

Fichier

nesbit_ICASSP09.pdf
Fichiers éditeurs autorisés sur une archive ouverte

Identifiants

  • HAL Id : inria-00544160, version 1

Citation

Andrew Nesbit, Emmanuel Vincent, Mark Plumbley. Benchmarking flexible adaptive time-frequency transforms for underdetermined audio source separation. 2009 IEEE Int. Conf. on Acoustics, Speech and Signal Processing (ICASSP), Apr 2009, Taipei, Taiwan. pp.37--40, 2009. 〈inria-00544160〉

Partager

Métriques

Consultations de la notice

343

Téléchargements de fichiers

103