Text-informed audio source separation using nonnegative matrix partial co-factorization

Abstract : We consider a single-channel source separation problem consisting in separating speech from nonstationary background such as music. We introduce a novel approach called text-informed separation, where the source separation process is guided by the corresponding textual information. First, given the text, we propose to produce a speech example via either a speech synthesizer or a human. We then use this example to guide source separation and, for that purpose, we introduce a new variant of the nonnegative matrix partial co-factorization (NMPCF) model based on a so called excitation-filter-channel speech model. The proposed NMPCF model allows sharing the linguistic information between the example speech and the speech in the mixture. We then derive the corresponding multiplicative update (MU) rules for the parameter estimation. Experimental results over different types of mixtures and speech examples show the effectiveness of the proposed approach.
Type de document :
Communication dans un congrès
IEEE International Workshop on Machine Learning for Signal Processing (MLSP 2013), Sep 2013, Southampton, United Kingdom. 2013
Liste complète des métadonnées

Littérature citée [24 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/hal-00870066
Contributeur : Alexey Ozerov <>
Soumis le : vendredi 4 octobre 2013 - 18:00:38
Dernière modification le : mardi 31 octobre 2017 - 08:52:02
Document(s) archivé(s) le : dimanche 5 janvier 2014 - 08:30:18

Fichier

MLSP2013_FINAL_VERSION.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-00870066, version 1

Citation

Luc Le Magoarou, Alexey Ozerov, Ngoc Duong. Text-informed audio source separation using nonnegative matrix partial co-factorization. IEEE International Workshop on Machine Learning for Signal Processing (MLSP 2013), Sep 2013, Southampton, United Kingdom. 2013. 〈hal-00870066〉

Partager

Métriques

Consultations de la notice

460

Téléchargements de fichiers

481