A General Flexible Framework for the Handling of Prior Information in Audio Source Separation

Alexey Ozerov; Emmanuel Vincent; Frédéric Bimbot

doi:10.1109/TASL.2011.2172425

Journal Articles IEEE Transactions on Audio, Speech and Language Processing Year : 2012

A General Flexible Framework for the Handling of Prior Information in Audio Source Separation

(1) , (1) , (1)

Alexey Ozerov

Function : Author
PersonId : 882775

Speech and sound data modeling and processing

Emmanuel Vincent

Function : Author
PersonId : 1256
IdHAL : emmanuelv
ORCID : 0000-0002-0183-7289
IdRef : 089360176

Speech and sound data modeling and processing

Frédéric Bimbot

Function : Author
PersonId : 830967

Speech and sound data modeling and processing

Abstract

Most of audio source separation methods are developed for a particular scenario characterized by the number of sources and channels and the characteristics of the sources and the mixing process. In this paper we introduce a general audio source separation framework based on a library of structured source models that enable the incorporation of prior knowledge about each source via user-specifiable constraints. While this framework generalizes several existing audio source separation methods, it also allows to imagine and implement new efficient methods that were not yet reported in the literature. We first introduce the framework by describing the model structure and constraints, explaining its generality, and summarizing its algorithmic implementation using a generalized expectation-maximization algorithm. Finally, we illustrate the above-mentioned capabilities of the framework by applying it in several new and existing configurations to different source separation problems. We have released a software tool named Flexible Audio Source Separation Toolbox (FASST) implementing a baseline version of the framework in Matlab.

Keywords

Audio source separation local Gaussian model nonnegative matrix factorization expectation-maximization

Domains

Signal and Image Processing Signal and Image processing

Fichier principal

general_ssep_journal_paper_v21.pdf (517.37 Ko)

Origin : Files produced by the author(s)

Alexey Ozerov : Connect in order to contact the contributor

https://hal.science/hal-00626962

Submitted on : Tuesday, January 5, 2016-9:56:41 AM

Last modification on : Friday, March 24, 2023-2:53:01 PM

Long-term archiving on: Thursday, April 7, 2016-3:00:03 PM

Dates and versions

hal-00626962 , version 1 (27-09-2011)

hal-00626962 , version 2 (22-06-2012)

hal-00626962 , version 3 (30-12-2015)

hal-00626962 , version 4 (05-01-2016)

Identifiers

HAL Id : hal-00626962 , version 4
DOI : 10.1109/TASL.2011.2172425

Cite

Alexey Ozerov, Emmanuel Vincent, Frédéric Bimbot. A General Flexible Framework for the Handling of Prior Information in Audio Source Separation. IEEE Transactions on Audio, Speech and Language Processing, 2012, 20 (4), pp.1118 - 1133. ⟨10.1109/TASL.2011.2172425⟩. ⟨hal-00626962v4⟩

Export

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

EC-PARIS UNIV-RENNES1 CNRS INRIA INSA-RENNES IRISA INRIA2 UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES INSA-GROUPE UR1-MATH-NUM

4791 View

3660 Download

A General Flexible Framework for the Handling of Prior Information in Audio Source Separation

Abstract

Keywords

Domains

Dates and versions

Identifiers

Cite

Export

Collections

Altmetric

Share