inria-00572249, version 1
A Tractable Framework for Estimating and Combining Spectral Source Models for Audio Source Separation
Simon Arberet
1Alexey Ozerov
a, 2Frédéric Bimbot
b, 2Rémi Gribonval
a, 2
N° RR-7556 (2011)
Résumé : The underdetermined blind audio source separation (BSS) problem is often addressed in the time-frequency (TF) domain assuming that each TF point is modeled as an independent random variable with sparse distribution. On the other hand, methods based on structured spectral model, such as the Spectral Gaussian Scale Mixture Models (Spectral-GSMMs) or Spectral Nonnegative Matrix Factorization models, perform better because they exploit the statistical diversity of audio source spectrograms, thus allowing to go beyond the simple sparsity assumption. However, in the case of discrete state-based models, such as Spectral-GSMMs, learning the models from the mixture can be computationally very expensive. One of the main problem is that using a classical Expectation-Maximization procedure often leads to an exponential complexity with respect to the number of sources. In this paper, we propose a framework with a linear complexity to learn spectral source models (including discrete state-based models) from noisy source estimates. Moreover, this framework allows combining probabilistic models of di erent nature that can be seen as a sort of probabilistic fusion. We illustrate that methods based on this framework can significantly improve the BSS performance compared to the state-of-the-art approaches.
- a – INRIA
- b – CNRS
- 1 : LTS2 - EPFL
- École Polytechnique Fédérale de Lausanne
- 2 : METISS (INRIA - IRISA)
- CNRS : UMR6074 – INRIA – INSA Rennes – Université de Rennes 1
- Collaboration : École Polytechnique Fédérale de Lausanne (EPFL)
- Domaine : Informatique/Traitement du signal et de l'image
Sciences de l'ingénieur/Traitement du signal et de l'image - Mots-clés : Blind source separation – multichannel audio – Gaussian mixture model – expectation-maximization algorithm – convolutive mixture
- Référence interne : RR-7556
- inria-00572249, version 1
- http://hal.inria.fr/inria-00572249
- oai:hal.inria.fr:inria-00572249
- Contributeur : Alexey Ozerov
- Soumis le : Mardi 1 Mars 2011, 10:13:44
- Dernière modification le : Jeudi 10 Mars 2011, 14:35:21






Documents associés
Exporter