Multichannel nonnegative tensor factorization with structured constraints for user-guided audio source separation

Alexey Ozerov; Cédric Févotte; Raphaël Blouet; Jean-Louis Durrieu

Communication Dans Un Congrès Année : 2011

Multichannel nonnegative tensor factorization with structured constraints for user-guided audio source separation

(1) , (2) , (3) , (4)

1
2
3
4

Alexey Ozerov

Fonction : Auteur
PersonId : 888401

Speech and sound data modeling and processing

Cédric Févotte

Fonction : Auteur
PersonId : 184864
IdHAL : cedric-fevotte
ORCID : 0000-0003-3801-5534
IdRef : 083298460

Laboratoire Traitement et Communication de l'Information

Raphaël Blouet

Fonction : Auteur

Yacast

Jean-Louis Durrieu

Fonction : Auteur
PersonId : 890630

Laboratoire de Traitement du signal [EPFL] / Signal Processing Laboratories

Résumé

Separating multiple tracks from professionally produced music recordings (PPMRs) is still a challenging problem. We address this task with a user-guided approach in which the separation system is provided segmental information indicating the time activations of the particular instruments to separate. This information may typically be retrieved from manual annotation. We use a so-called multichannel nonnegative tensor factorization (NTF) model, in which the original sources are observed through a multichannel convolutive mixture and in which the source power spectrograms are jointly modeled by a 3-valence (time/frequency/source) tensor. Our user-guided separation method produced competitive results at the 2010 Signal Separation Evaluation Campaign, with sufficient quality for real-world music editing applications.

Domaines

Traitement du signal et de l'image [eess.SP] Traitement du signal et de l'image [eess.SP]

Fichier principal

Ozerov_et_al_icassp11.pdf (237.36 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Alexey Ozerov : Connectez-vous pour contacter le contributeur

https://inria.hal.science/inria-00564851

Soumis le : jeudi 10 février 2011-11:37:23

Dernière modification le : lundi 9 octobre 2023-12:49:40

Archivage à long terme le : mercredi 11 mai 2011-02:52:35

Dates et versions

inria-00564851 , version 1 (10-02-2011)

Identifiants

HAL Id : inria-00564851 , version 1

Citer

Alexey Ozerov, Cédric Févotte, Raphaël Blouet, Jean-Louis Durrieu. Multichannel nonnegative tensor factorization with structured constraints for user-guided audio source separation. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP'11), May 2011, Prague, Czech Republic. ⟨inria-00564851⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

INSTITUT-TELECOM EC-PARIS UNIV-RENNES1 CNRS INRIA INSA-RENNES IRISA PARISTECH IRISA-D5 INRIA2 UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES INSA-GROUPE LTCI ANR UR1-MATH-NUM

378 Consultations

692 Téléchargements

Multichannel nonnegative tensor factorization with structured constraints for user-guided audio source separation

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager