Multi-channel audio source separation using multiple deformed references

Nathan Souviraà-Labastie 1 Anaik Olivero 1 Emmanuel Vincent 2 Frédéric Bimbot 1
1 PANAMA - Parcimonie et Nouveaux Algorithmes pour le Signal et la Modélisation Audio
Inria Rennes – Bretagne Atlantique , IRISA-D5 - SIGNAUX ET IMAGES NUMÉRIQUES, ROBOTIQUE
2 MULTISPEECH - Speech Modeling for Facilitating Oral-Based Communication
Inria Nancy - Grand Est, LORIA - NLPKD - Department of Natural Language Processing & Knowledge Discovery
Abstract : We present a general multi-channel source separation framework where additional audio references are available for one (or more) source(s) of a given mixture.Each audio reference is another mixture which is supposed to contain at least one source similar to one of the target sources.Deformations between the sources of interest and their references are modeled in a linear manner using a generic formulation.This is done by adding transformation matrices to an excitation-filter model, hence affecting different axes, namely frequency, dictionary component or time. A nonnegative matrix co-factorization algorithm and a generalized expectation-maximization algorithm are used to estimate the parameters of the model.Different model parameterizations and different combinations of algorithms are tested on music plus voice mixtures guided by music and/or voice references and on professionally-produced music recordings guided by cover references.Our algorithms improve the signal-to-distortion ratio (SDR) of the sources with the lowest intensity by 9 to 15 decibels (dB) with respect to original mixtures.
Type de document :
Article dans une revue
IEEE Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers, 2015, 23 (11), pp.1775-1787
Liste complète des métadonnées

Littérature citée [40 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/hal-01070298
Contributeur : Nathan Souviraà-Labastie <>
Soumis le : mercredi 25 novembre 2015 - 17:52:48
Dernière modification le : mercredi 16 mai 2018 - 11:24:07
Document(s) archivé(s) le : samedi 29 avril 2017 - 00:47:18

Fichier

multi_ss_replicas_taslp_jrnl_h...
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-01070298, version 4

Citation

Nathan Souviraà-Labastie, Anaik Olivero, Emmanuel Vincent, Frédéric Bimbot. Multi-channel audio source separation using multiple deformed references. IEEE Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers, 2015, 23 (11), pp.1775-1787. 〈hal-01070298v4〉

Partager

Métriques

Consultations de la notice

546

Téléchargements de fichiers

450