A robust method to count and locate audio sources in a multichannel underdetermined mixture

Simon Arberet 1 Rémi Gribonval 1 Frédéric Bimbot 1
1 METISS - Speech and sound data modeling and processing
IRISA - Institut de Recherche en Informatique et Systèmes Aléatoires, Inria Rennes – Bretagne Atlantique
Abstract : We propose a method to count and estimate the mixing directions in an underdetermined multichannel mix- ture. The approach is based on the hypothesis that in the neighbourhood of some time-frequency points, only one source essentially contributes to the mixture: such time-frequency points can provide robust local estimates of the corresponding source direction. At the core of our contribution is a statistical model to exploit a local confidence measure which detects the time- frequency regions where such robust information is available. A clustering algorithm called DEMIX is proposed to merge the information from all time-frequency regions according to their confidence level. So as to estimate the delays of anechoic mixtures and overcome the intrinsic ambiguities of phase unwrapping as met with DUET, we propose a technique similar to GCC-PHAT which is able to estimate delays that can largely exceed one sample. We propose an extensive experimental study which shows that the resulting method is more robust in conditions where all DUET-like comparable methods fail, that is in particular : a) when time-delays largely exceed one sample; b) when the source directions are very close.
Complete list of metadatas

Cited literature [26 references]  Display  Hide  Download

https://hal.inria.fr/inria-00489529
Contributor : Rémi Gribonval <>
Submitted on : Thursday, January 27, 2011 - 10:03:34 PM
Last modification on : Friday, November 16, 2018 - 1:23:32 AM
Long-term archiving on : Tuesday, November 6, 2012 - 12:30:09 PM

File

2009_IEEE_TSP_ArberetGribonval...
Files produced by the author(s)

Identifiers

Citation

Simon Arberet, Rémi Gribonval, Frédéric Bimbot. A robust method to count and locate audio sources in a multichannel underdetermined mixture. IEEE Transactions on Signal Processing, Institute of Electrical and Electronics Engineers, 2010, 58 (1), pp.121--133. ⟨10.1109/TSP.2009.2030854⟩. ⟨inria-00489529⟩

Share

Metrics

Record views

478

Files downloads

564