Kernel Spectrogram models for source separation

Antoine Liutkus; Zafar Rafii; Bryan Pardo; Derry Fitzgerald; Laurent Daudet

Communication Dans Un Congrès Année : 2014

Kernel Spectrogram models for source separation

(1) , (2) , (2) , (3) , (4)

1
2
3
4

Antoine Liutkus

Fonction : Auteur
PersonId : 2740
IdHAL : antoine-liutkus
ORCID : 0000-0002-3458-6498
IdRef : 167600419

Analysis, perception and recognition of speech

Zafar Rafii

Fonction : Auteur

Northwestern University [Evanston]

Bryan Pardo

Fonction : Auteur

Northwestern University [Evanston]

Derry Fitzgerald

Fonction : Auteur

NIMBUS Centre [Cork]

Laurent Daudet

Fonction : Auteur

Institut Langevin - Ondes et Images (UMR7587)

Résumé

In this study, we introduce a new framework called Kernel Additive Modelling for audio spectrograms that can be used for multichannel source separation. It assumes that the spectrogram of a source at any time-frequency bin is close to its value in a neighbourhood indicated by a source-specific proximity kernel. The rationale for this model is to easily account for features like periodicity, stability over time or frequency, self-similarity, etc. In many cases, such local dynamics are indeed much more natural to assess than any global model such as a tensor factorization. This framework permits one to use different proximity kernels for different sources and to estimate them blindly using their mixtures only. Estimation is performed using a variant of the kernel backfitting algorithm that allows for multichannel mixtures and permits parallelization. Experimental results on the separation of vocals from musical backgrounds demonstrate the efficiency of the approach.

Mots clés

audio source separation spatial filtering spectrogram models

Domaines

Traitement du signal et de l'image [eess.SP] Traitement du signal et de l'image [eess.SP]

Fichier principal

KAM_HSCMAv2.pdf (237.5 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Antoine Liutkus : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-00959384

Soumis le : lundi 16 février 2015-00:28:14

Dernière modification le : vendredi 19 avril 2024-16:18:59

Archivage à long terme le : dimanche 16 avril 2017-08:35:29

Dates et versions

hal-00959384 , version 1 (14-03-2014)

hal-00959384 , version 2 (15-03-2014)

hal-00959384 , version 3 (21-03-2014)

hal-00959384 , version 4 (16-02-2015)

Identifiants

HAL Id : hal-00959384 , version 4

Citer

Antoine Liutkus, Zafar Rafii, Bryan Pardo, Derry Fitzgerald, Laurent Daudet. Kernel Spectrogram models for source separation. HSCMA, May 2014, Nancy, France. ⟨hal-00959384v4⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

ESPCI CNRS INRIA PARISTECH IL LANGEVIN UNIV-LORRAINE INRIA2 LORIA PSL SORBONNE-UNIVERSITE SU-SCIENCES UP-SCIENCES ESPCI-PSL

681 Consultations

1154 Téléchargements

Kernel Spectrogram models for source separation

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager