Skip to Main content Skip to Navigation
Other publications

Audio source separation using hierarchical phase-invariant models

Emmanuel Vincent 1
1 METISS - Speech and sound data modeling and processing
IRISA - Institut de Recherche en Informatique et Systèmes Aléatoires, Inria Rennes – Bretagne Atlantique
Abstract : Audio source separation consists of analyzing a given audio recording so as to estimate the signal produced by each sound source for listening or information retrieval purposes. In the last five years, algorithms based on hierarchical phase-invariant models such as single or multichannel hidden Markov models (HMMs) or nonnegative matrix factorization (NMF) have become popular. In this paper, we provide an overview of these models and discuss their advantages compared to established algorithms such as nongaussianity-based frequency-domain independent component analysis (FDICA) and sparse component analysis (SCA) for the separation of complex mixtures involving many sources or reverberation.We argue how hierarchical phase-invariant modeling could form the basis of future modular source separation systems.
Complete list of metadata

Cited literature [19 references]  Display  Hide  Download
Contributor : Emmanuel Vincent Connect in order to contact the contributor
Submitted on : Tuesday, December 7, 2010 - 2:17:52 PM
Last modification on : Thursday, January 20, 2022 - 4:18:37 PM
Long-term archiving on: : Tuesday, March 8, 2011 - 4:22:21 AM


Publisher files allowed on an open archive


  • HAL Id : inria-00544170, version 1


Emmanuel Vincent. Audio source separation using hierarchical phase-invariant models. 2009, pp.12--16. ⟨inria-00544170⟩



Les métriques sont temporairement indisponibles