A general framework for online audio source separation

Laurent S. R. Simon 1 Emmanuel Vincent 1
1 METISS - Speech and sound data modeling and processing
IRISA - Institut de Recherche en Informatique et Systèmes Aléatoires, Inria Rennes – Bretagne Atlantique
Abstract : We consider the problem of online audio source separation. Existing algorithms adopt either a sliding block approach or a stochastic gradient approach, which is faster but less accurate. Also, they rely either on spatial cues or on spectral cues and cannot separate certain mixtures. In this paper, we design a general online audio source separation framework that combines both approaches and both types of cues. The model parameters are estimated in the Maximum Likelihood (ML) sense using a Generalised Expectation Maximisation (GEM) algorithm with multiplicative updates. The separation performance is evaluated as a function of the block size and the step size and compared to that of an offline algorithm.
Document type :
Conference papers
Complete list of metadatas

Cited literature [12 references]  Display  Hide  Download

https://hal.inria.fr/hal-00655398
Contributor : Laurent S. R. Simon <>
Submitted on : Wednesday, December 28, 2011 - 11:59:09 AM
Last modification on : Thursday, March 21, 2019 - 2:20:42 PM
Long-term archiving on : Thursday, March 29, 2012 - 2:20:33 AM

Files

LVA2012.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-00655398, version 1
  • ARXIV : 1112.6178

Citation

Laurent S. R. Simon, Emmanuel Vincent. A general framework for online audio source separation. International conference on Latent Variable Analysis and Signal Separation, Mar 2012, Tel-Aviv, Israel. ⟨hal-00655398⟩

Share

Metrics

Record views

565

Files downloads

532