Skip to Main content Skip to Navigation
New interface
Journal articles

Student's t Source and Mixing Models for Multichannel Audio Source Separation

Abstract : This paper presents a Bayesian framework for under-determined audio source separation in multichannel reverberant mixtures. We model the source signals as Student's t latent random variables in a time-frequency domain. The specific structure of musical signals in this domain is exploited by means of a non-negative matrix factorization model. Conversely, we design the mixing model in the time domain. In addition to leading to an exact representation of the convolutive mixing process, this approach allows us to develop simple probabilistic priors for the mixing filters. Indeed, as those filters correspond to room responses they exhibit a simple characteristic structure in the time domain that can be used to guide their estimation. We also rely on the Student's t distribution for modeling the impulse response of the mixing filters. From this model, we develop a variational inference algorithm in order to perform source separation. The experimental evaluation demonstrates the potential of this approach for separating multichannel reverberant mixtures.
Document type :
Journal articles
Complete list of metadata

Cited literature [61 references]  Display  Hide  Download
Contributor : Simon Leglaive Connect in order to contact the contributor
Submitted on : Saturday, March 3, 2018 - 7:00:07 PM
Last modification on : Tuesday, March 8, 2022 - 5:46:02 PM


Files produced by the author(s)


  • HAL Id : hal-01584755, version 2



Simon Leglaive, Roland Badeau, Gael Richard. Student's t Source and Mixing Models for Multichannel Audio Source Separation. IEEE/ACM Transactions on Audio, Speech and Language Processing, 2018, 26 (6), pp.1150-1164. ⟨hal-01584755v2⟩



Record views


Files downloads