Kernel Spectrogram models for source separation

Abstract : In this study, we introduce a new framework called Kernel Additive Modelling for audio spectrograms that can be used for multichannel source separation. It assumes that the spectrogram of a source at any time-frequency bin is close to its value in a neighbourhood indicated by a source-specific proximity kernel. The rationale for this model is to easily account for features like periodicity, stability over time or frequency, self-similarity, etc. In many cases, such local dynamics are indeed much more natural to assess than any global model such as a tensor factorization. This framework permits one to use different proximity kernels for different sources and to estimate them blindly using their mixtures only. Estimation is performed using a variant of the kernel backfitting algorithm that allows for multichannel mixtures and permits parallelization. Experimental results on the separation of vocals from musical backgrounds demonstrate the efficiency of the approach.
Type de document :
Communication dans un congrès
HSCMA, May 2014, Nancy, France. 2014
Liste complète des métadonnées

Littérature citée [28 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/hal-00959384
Contributeur : Antoine Liutkus <>
Soumis le : lundi 16 février 2015 - 00:28:14
Dernière modification le : vendredi 16 novembre 2018 - 02:13:17
Document(s) archivé(s) le : dimanche 16 avril 2017 - 08:35:29

Fichier

KAM_HSCMAv2.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-00959384, version 4

Citation

Antoine Liutkus, Zafar Rafii, Bryan Pardo, Derry Fitzgerald, Laurent Daudet. Kernel Spectrogram models for source separation. HSCMA, May 2014, Nancy, France. 2014. 〈hal-00959384v4〉

Partager

Métriques

Consultations de la notice

656

Téléchargements de fichiers

229