Student's t Source and Mixing Models for Multichannel Audio Source Separation

This paper presents a Bayesian framework for under-determined audio source separation in multichannel reverberant mixtures. We model the source signals as Student's t latent random variables in a time-frequency domain. The specific structure of musical signals in this domain is exploited by means of a non-negative matrix factorization model. Conversely, we design the mixing model in the time domain. In addition to leading to an exact representation of the convolutive mixing process, this approach allows us to develop simple probabilistic priors for the mixing filters. Indeed, as those filters correspond to room responses they exhibit a simple characteristic structure in the time domain that can be used to guide their estimation. We also rely on the Student's t distribution for modeling the impulse response of the mixing filters. From this model, we develop a variational inference algorithm in order to perform source separation. The experimental evaluation demonstrates the potential of this approach for separating multichannel reverberant mixtures.

Mots clés

variational inference statistical room acoustics Student's t distribution Audio source separation multichannel reverberant mixtures non-negative matrix factorization

Domaines

Traitement du signal et de l'image [eess.SP]

Fichier principal

FinalManuscript.pdf (806.27 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Simon Leglaive : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-01584755

Soumis le : samedi 3 mars 2018-19:00:07

Dernière modification le : lundi 9 octobre 2023-12:49:42

Dates et versions

hal-01584755 , version 1 (09-09-2017)

hal-01584755 , version 2 (03-03-2018)

Identifiants

HAL Id : hal-01584755 , version 2

Citer

Simon Leglaive, Roland Badeau, Gael Richard. Student's t Source and Mixing Models for Multichannel Audio Source Separation. IEEE/ACM Transactions on Audio, Speech and Language Processing, 2018, 26 (6), pp.1150-1164. ⟨hal-01584755v2⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

INSTITUT-TELECOM PARISTECH LTCI IDS S2A ANR

301 Consultations

834 Téléchargements