Skip to Main content Skip to Navigation
New interface
Book sections

An introduction to multichannel NMF for audio source separation

Alexey Ozerov 1 Cédric Févotte 2, 3 Emmanuel Vincent 4 
2 IRIT-SC - Signal et Communications
IRIT - Institut de recherche en informatique de Toulouse
4 MULTISPEECH - Speech Modeling for Facilitating Oral-Based Communication
Inria Nancy - Grand Est, LORIA - NLPKD - Department of Natural Language Processing & Knowledge Discovery
Abstract : This chapter introduces multichannel nonnegative matrix factorization (NMF) methods for audio source separation. All the methods and some of their extensions are introduced within a more general local Gaussian modeling (LGM) framework. These methods are very attractive since allow combining spatial and spectral cues in a joint and principal way, but also are natural extensions and generalizations of many single-channel NMF-based methods to the multichannel case. The chapter introduces the spectral (NMF-based) and spatial models, as well as the way to combine them within the LGM framework. Model estimation criteria and algorithms are described as well, while going deeper into details of some of them.
Document type :
Book sections
Complete list of metadata

Cited literature [45 references]  Display  Hide  Download
Contributor : Emmanuel Vincent Connect in order to contact the contributor
Submitted on : Friday, January 12, 2018 - 11:44:10 PM
Last modification on : Monday, July 25, 2022 - 3:44:13 AM
Long-term archiving on: : Monday, May 7, 2018 - 7:34:17 PM


Files produced by the author(s)


  • HAL Id : hal-01631187, version 2


Alexey Ozerov, Cédric Févotte, Emmanuel Vincent. An introduction to multichannel NMF for audio source separation. Audio Source Separation, Springer, 2018, Signals and Communication Technology. ⟨hal-01631187v2⟩



Record views


Files downloads