Scalable audio separation with light kernel additive modelling

Antoine Liutkus; Derry Fitzgerald; Zafar Rafii

Communication Dans Un Congrès Année : 2015

Scalable audio separation with light kernel additive modelling

(1, 2) , (3) , (4)

1
2
3
4

Antoine Liutkus

Fonction : Auteur
PersonId : 2740
IdHAL : antoine-liutkus
ORCID : 0000-0002-3458-6498
IdRef : 167600419

Speech Modeling for Facilitating Oral-Based Communication

Analysis, perception and recognition of speech

Derry Fitzgerald

Fonction : Auteur

NIMBUS Centre [Cork]

Zafar Rafii

Fonction : Auteur

Northwestern University [Evanston]

Résumé

Recently, Kernel Additive Modelling (KAM) was proposed as a unified framework to achieve multichannel audio source separation. Its main feature is to use kernel models for locally describing the spectrograms of the sources. Such kernels can capture source features such as repetitivity, stability over time and/or frequency, self-similarity, etc. KAM notably subsumes many popular and effective methods from the state of the art, including REPET and harmonic/percussive separation with median filters. However, it also comes with an important drawback in its initial form: its memory usage badly scales with the number of sources. Indeed, KAM requires the storage of the full-resolution spectrogram for each source, which may become prohibitive for full-length tracks or many sources. In this paper, we show how it can be combined with a fast compression algorithm of its parameters to address the scalability issue, thus enabling its use on small platforms or mobile devices.

Mots clés

sound source separation randomized algorithms Kernel Additive Modelling

Domaines

Traitement du signal et de l'image [eess.SP] Recherche d'information [cs.IR]

Fichier principal

ICASSP-lightKAM.pdf (218.04 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Antoine Liutkus : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-01114890

Soumis le : mardi 10 février 2015-13:50:10

Dernière modification le : lundi 11 septembre 2023-17:41:19

Dates et versions

hal-01114890 , version 1 (10-02-2015)

hal-01114890 , version 2 (10-02-2015)

Identifiants

HAL Id : hal-01114890 , version 2

Citer

Antoine Liutkus, Derry Fitzgerald, Zafar Rafii. Scalable audio separation with light kernel additive modelling. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE, Apr 2015, Brisbane, Australia. ⟨hal-01114890v2⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA UNIV-LORRAINE INRIA2 LORIA LORIA-NLPKD

1097 Consultations

851 Téléchargements

Scalable audio separation with light kernel additive modelling

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager