Audio source separation using multiple deformed references

Nathan Souviraà-Labastie 1 Anaik Olivero 1 Emmanuel Vincent 2 Frédéric Bimbot 1
1 PANAMA - Parcimonie et Nouveaux Algorithmes pour le Signal et la Modélisation Audio
Inria Rennes – Bretagne Atlantique , IRISA-D5 - SIGNAUX ET IMAGES NUMÉRIQUES, ROBOTIQUE
2 PAROLE - Analysis, perception and recognition of speech
Inria Nancy - Grand Est, LORIA - NLPKD - Department of Natural Language Processing & Knowledge Discovery
Abstract : This paper deals with audio source separation guided by multiple audio references. We present a general framework where additional audio references for one or more sources of a given mixture are available. Each audio reference is another mixture which is supposed to contain at least one source similar to one of the target sources. Deformations between the sources of interest and their references are modeled in a general manner. A nonnegative matrix co-factorization algorithm is used which allows sharing of information between the considered mixtures. We run our algorithm on music plus voice mixtures with music and/or voice references. Applied on movies and TV series data, our algorithm improves the signal-todistortion ratio (SDR) of the sources with the lowest intensity by 9 to 12 decibels with respect to original mixture.
Document type :
Conference papers
Complete list of metadatas

https://hal.inria.fr/hal-01017571
Contributor : Nathan Souviraà-Labastie <>
Submitted on : Wednesday, July 2, 2014 - 5:10:40 PM
Last modification on : Saturday, March 30, 2019 - 1:26:18 AM
Long-term archiving on : Tuesday, October 13, 2015 - 3:06:11 PM

File

souviraa_eusipco14.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01017571, version 1

Citation

Nathan Souviraà-Labastie, Anaik Olivero, Emmanuel Vincent, Frédéric Bimbot. Audio source separation using multiple deformed references. Eusipco, Sep 2014, Lisboa, Portugal. ⟨hal-01017571⟩

Share

Metrics

Record views

919

Files downloads

379