21734 articles – 15570 references  [version française]

hal-00717366, version 1

Semi-supervised {NMF} with time-frequency annotations for single-channel source separation

Augustin Lefèvre (, http://www.di.ens.fr/~lefevrea) 1, Francis Bach () 12, Cédric Févotte 3

ISMIR 2012 : 13th International Society for Music Information Retrieval Conference (2012) ??

Abstract: We formulate a novel extension of nonnegative matrix factorization (NMF) to take into account partial information on source-specific activity in the spectrogram. This information comes in the form of masking coefficients, such as those found in an ideal binary mask. We show that state-of-the-art results in source separation may be achieved with only a limited amount of correct annotation, and furthermore our algorithm is robust to incorrect annotations. Since in practice ideal annotations are not observed, we propose several supervision scenarios to estimate the ideal mask- ing coefficients. First, manual annotations by a trained user on a dedicated graphical user interface are shown to provide satisfactory performance although they are prone to errors. Second, we investigate simple learning strate- gies to predict the Wiener coefficients based on local information around a given time-frequency bin of the spec- trogram. Results on single-channel source separation show that time-frequency annotations allow to disambiguate the source separation problem, and learned annotations open the way for a completely unsupervised learning procedure for source separation with no human intervention.

  • 1:  SIERRA (INRIA Paris - Rocquencourt)
  • INRIA : PARIS - ROCQUENCOURT – Ecole normale supérieure de Paris - ENS Paris – CNRS : UMR8548
  • 2:  Laboratoire d'informatique de l'école normale supérieure (LIENS)
  • CNRS : UMR8548 – Ecole normale supérieure de Paris - ENS Paris
  • 3:  Laboratoire Traitement et Communication de l'Information [Paris] (LTCI)
  • Télécom ParisTech – CNRS : UMR5141
  • Domain : Mathematics/Statistics
    Statistics/Statistics Theory
  • Keywords : nonnegative matrix factorization – inpainting – matrix completion – single channel source separation – blind source separation – unsupervised learning
 
  • hal-00717366, version 1
  • oai:hal.archives-ouvertes.fr:hal-00717366
  • From: 
  • Submitted on: Thursday, 12 July 2012 16:45:21
  • Updated on: Monday, 16 July 2012 13:29:36