Skip to Main content Skip to Navigation
Journal articles

Tracking Multiple Audio Sources with the Von Mises Distribution and Variational EM

Yutong Ban 1 Xavier Alameda-Pineda 1 Christine Evers 2 Radu Horaud 1
1 PERCEPTION - Interpretation and Modelling of Images and Videos
Inria Grenoble - Rhône-Alpes, Grenoble INP - Institut polytechnique de Grenoble - Grenoble Institute of Technology, LJK - Laboratoire Jean Kuntzmann
Abstract : In this paper we address the problem of simultaneously tracking several moving audio sources, namely the problem of estimating source trajectories from a sequence of observed features. We propose to use the von Mises distribution to model audio-source directions of arrival with circular random variables. This leads to a Bayesian filtering formulation which is intractable because of the combinatorial explosion of associating observed variables with latent variables, over time. We propose a variational approximation of the filtering distribution. We infer a variational expectation-maximization algorithm that is both computationally tractable and time efficient. We propose an audio-source birth method that favors smooth source trajectories and which is used both to initialize the number of active sources and to detect new sources. We perform experiments with the recently released LOCATA dataset comprising two moving sources and a moving microphone array mounted onto a robot.
Complete list of metadata

Cited literature [22 references]  Display  Hide  Download
Contributor : Team Perception Connect in order to contact the contributor
Submitted on : Thursday, March 21, 2019 - 10:00:40 AM
Last modification on : Wednesday, November 3, 2021 - 7:49:30 AM
Long-term archiving on: : Saturday, June 22, 2019 - 1:08:37 PM


Files produced by the author(s)




Yutong Ban, Xavier Alameda-Pineda, Christine Evers, Radu Horaud. Tracking Multiple Audio Sources with the Von Mises Distribution and Variational EM. IEEE Signal Processing Letters, Institute of Electrical and Electronics Engineers, 2019, 26 (6), pp.798 - 802. ⟨10.1109/LSP.2019.2908376⟩. ⟨hal-01969050⟩



Record views


Files downloads