Single-channel audio source separation with NMF: divergences, constraints and algorithms

Abstract : Spectral decomposition by nonnegative matrix factorisation (NMF) has become state-of-the-art practice in many audio signal processing tasks, such as source separation, enhancement or transcription. This chapter reviews the fundamentals of NMF-based audio decomposition, in unsupervised and informed settings. We formulate NMF as an optimisation problem and discuss the choice of the measure of fit. We present the standard majorisation-minimisation strategy to address optimisation for NMF with common beta-divergence, a family of measures of fit that takes the quadratic cost, the generalised Kullback-Leibler divergence and the Itakura-Saito divergence as special cases. We discuss the reconstruction of time-domain components from the spectral factorisation and present common variants of NMF-based spectral decomposition: supervised and informed settings, regularised versions, temporal models.
Document type :
Book sections
Complete list of metadatas

Cited literature [64 references]  Display  Hide  Download

https://hal.inria.fr/hal-01631185
Contributor : Emmanuel Vincent <>
Submitted on : Wednesday, November 8, 2017 - 6:20:11 PM
Last modification on : Friday, June 14, 2019 - 6:31:17 PM
Long-term archiving on : Friday, February 9, 2018 - 1:21:50 PM

File

fevotte_book18.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01631185, version 1

Citation

Cédric Févotte, Emmanuel Vincent, Alexey Ozerov. Single-channel audio source separation with NMF: divergences, constraints and algorithms. Audio Source Separation, Springer, 2018. ⟨hal-01631185⟩

Share

Metrics

Record views

698

Files downloads

2023