On-the-fly audio source separation

Dalia El Badawy; Ngoc Q. K. Duong; Alexey Ozerov

Communication Dans Un Congrès Année : 2014

On-the-fly audio source separation

(1) , (1) , (1)

Dalia El Badawy

Fonction : Auteur

Technicolor R & I [Cesson Sévigné]

Ngoc Q. K. Duong

Fonction : Auteur
PersonId : 946470

Technicolor R & I [Cesson Sévigné]

Alexey Ozerov

Fonction : Auteur
PersonId : 930358

Technicolor R & I [Cesson Sévigné]

Résumé

This paper addresses the challenging task of single channel audio source separation. We introduce a novel concept of on-the-fly audio source separation which greatly simplifies the user's interaction with the system compared to the state-of-the-art user-guided approaches. In the proposed framework, the user is only asked to listen to an audio mixture and type some keywords (e.g. "dog barking", "wind", etc.) describing the sound sources to be separated. These keywords are then used as text queries to search for audio examples from the internet to guide the separation process. In particular, we propose several approaches to efficiently exploit these retrieved examples, including an approach based on a generic spectral model with group sparsity-inducing constraints. Finally, we demonstrate the effectiveness of the proposed framework with mixtures containing various types of sounds.

Mots clés

On-the-fly source separation user-guided non-negative matrix factorization group sparsity universal spectral model

Domaines

Traitement du signal et de l'image [eess.SP] Traitement du signal et de l'image [eess.SP]

Fichier principal

ElBadawy_et_al_2014.pdf (1.17 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Alexey Ozerov : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-01023221

Soumis le : vendredi 11 juillet 2014-16:10:54

Dernière modification le : lundi 14 juillet 2014-08:52:53

Archivage à long terme le : samedi 11 octobre 2014-13:05:10

Dates et versions

hal-01023221 , version 1 (11-07-2014)

Identifiants

HAL Id : hal-01023221 , version 1

Citer

Dalia El Badawy, Ngoc Q. K. Duong, Alexey Ozerov. On-the-fly audio source separation. the 24th IEEE International Workshop on Machine Learning for Signal Processing (MLSP 2014), Sep 2014, Reims, France. ⟨hal-01023221⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

166 Consultations

824 Téléchargements

On-the-fly audio source separation

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Partager