Audio source separation using hierarchical phase-invariant models

Emmanuel Vincent 1
1 METISS - Speech and sound data modeling and processing
IRISA - Institut de Recherche en Informatique et Systèmes Aléatoires, Inria Rennes – Bretagne Atlantique
Abstract : Audio source separation consists of analyzing a given audio recording so as to estimate the signal produced by each sound source for listening or information retrieval purposes. In the last five years, algorithms based on hierarchical phase-invariant models such as single or multichannel hidden Markov models (HMMs) or nonnegative matrix factorization (NMF) have become popular. In this paper, we provide an overview of these models and discuss their advantages compared to established algorithms such as nongaussianity-based frequency-domain independent component analysis (FDICA) and sparse component analysis (SCA) for the separation of complex mixtures involving many sources or reverberation.We argue how hierarchical phase-invariant modeling could form the basis of future modular source separation systems.
Liste complète des métadonnées

Cited literature [19 references]  Display  Hide  Download

https://hal.inria.fr/inria-00544170
Contributor : Emmanuel Vincent <>
Submitted on : Tuesday, December 7, 2010 - 2:17:52 PM
Last modification on : Thursday, March 21, 2019 - 2:20:42 PM
Document(s) archivé(s) le : Tuesday, March 8, 2011 - 4:22:21 AM

File

vincent_NOLISP09.pdf
Publisher files allowed on an open archive

Identifiers

  • HAL Id : inria-00544170, version 1

Citation

Emmanuel Vincent. Audio source separation using hierarchical phase-invariant models. 2009, pp.12--16. ⟨inria-00544170⟩

Share

Metrics

Record views

428

Files downloads

354