28620 articles – 22133 Notices  [english version]

inria-00541436, version 1

Nonnegative matrix factorization and spatial covariance model for under-determined reverberant audio source separation

Simon Arberet () a1, Alexey Ozerov () b2, Ngoc Q. K. Duong () b2, Emmanuel Vincent () b2, Rémi Gribonval () b2, Frédéric Bimbot () c2, Pierre Vandergheynst () 1

Information Sciences Signal Processing and their Applications (ISSPA), 2010 10th International Conference on (2010) 1--4

Résumé : We address the problem of blind audio source separation in the under-determined and convolutive case. The contribution of each source to the mixture channels in the time-frequency domain is modeled by a zero-mean Gaussian random vector with a full rank covariance matrix composed of two terms: a variance which represents the spectral properties of the source and which is modeled by a nonnegative matrix factorization (NMF) model and another full rank covariance matrix which encodes the spatial properties of the source contribution in the mixture. We address the estimation of these parameters by maximizing the likelihood of the mixture using an expectation-maximization (EM) algorithm. Theoretical propositions are corroborated by experimental studies on stereo reverberant music mixtures.

  • a –  EPFL
  • b –  INRIA
  • c –  CNRS
  • 1 :  LTS2 - EPFL
  • École Polytechnique Fédérale de Lausanne
  • 2 :  METISS (INRIA - IRISA)
  • CNRS : UMR6074 – INRIA – Institut National des Sciences Appliquées (INSA) - Rennes – Université de Rennes 1
  • Domaine : Informatique/Traitement du signal et de l'image
    Sciences de l'ingénieur/Traitement du signal et de l'image
 
  • inria-00541436, version 1
  • oai:hal.inria.fr:inria-00541436
  • Contributeur : 
  • Soumis le : Samedi 5 Février 2011, 12:53:44
  • Dernière modification le : Dimanche 6 Février 2011, 20:28:58