Student's t Source and Mixing Models for Multichannel Audio Source Separation

Abstract : This paper presents a Bayesian framework for under-determined audio source separation in multichannel reverberant mixtures. We model the source signals as Student's t latent random variables in a time-frequency domain. The specific structure of musical signals in this domain is exploited by means of a non-negative matrix factorization model. Conversely, we design the mixing model in the time domain. In addition to leading to an exact representation of the convolutive mixing process, this approach allows us to develop simple probabilistic priors for the mixing filters. Indeed, as those filters correspond to room responses they exhibit a simple characteristic structure in the time domain that can be used to guide their estimation. We also rely on the Student's t distribution for modeling the impulse response of the mixing filters. From this model, we develop a variational inference algorithm in order to perform source separation. The experimental evaluation demonstrates the potential of this approach for separating multichannel reverberant mixtures.
Document type :
Journal articles
Complete list of metadatas

Cited literature [61 references]  Display  Hide  Download

https://hal.inria.fr/hal-01584755
Contributor : Simon Leglaive <>
Submitted on : Saturday, March 3, 2018 - 7:00:07 PM
Last modification on : Wednesday, July 3, 2019 - 3:02:02 PM

File

FinalManuscript.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01584755, version 2

Citation

Simon Leglaive, Roland Badeau, Gael Richard. Student's t Source and Mixing Models for Multichannel Audio Source Separation. IEEE/ACM Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers, 2018, 26 (6), pp.1150-1164. ⟨hal-01584755v2⟩

Share

Metrics

Record views

236

Files downloads

931