Multichannel nonnegative tensor factorization with structured constraints for user-guided audio source separation

Abstract : Separating multiple tracks from professionally produced music recordings (PPMRs) is still a challenging problem. We address this task with a user-guided approach in which the separation system is provided segmental information indicating the time activations of the particular instruments to separate. This information may typically be retrieved from manual annotation. We use a so-called multichannel nonnegative tensor factorization (NTF) model, in which the original sources are observed through a multichannel convolutive mixture and in which the source power spectrograms are jointly modeled by a 3-valence (time/frequency/source) tensor. Our user-guided separation method produced competitive results at the 2010 Signal Separation Evaluation Campaign, with sufficient quality for real-world music editing applications.
Type de document :
Communication dans un congrès
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP'11), May 2011, Prague, Czech Republic. 2011
Liste complète des métadonnées

Littérature citée [13 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/inria-00564851
Contributeur : Alexey Ozerov <>
Soumis le : jeudi 10 février 2011 - 11:37:23
Dernière modification le : mercredi 11 avril 2018 - 01:54:12
Document(s) archivé(s) le : mercredi 11 mai 2011 - 02:52:35

Fichier

Ozerov_et_al_icassp11.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : inria-00564851, version 1

Citation

Alexey Ozerov, Cédric Févotte, Raphaël Blouet, Jean-Louis Durrieu. Multichannel nonnegative tensor factorization with structured constraints for user-guided audio source separation. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP'11), May 2011, Prague, Czech Republic. 2011. 〈inria-00564851〉

Partager

Métriques

Consultations de la notice

546

Téléchargements de fichiers

440