Audio Source Separation With a Single Sensor

Laurent Benaroya 1 Frédéric Bimbot 1 Rémi Gribonval 1
1 METISS - Speech and sound data modeling and processing
IRISA - Institut de Recherche en Informatique et Systèmes Aléatoires, Inria Rennes – Bretagne Atlantique
Abstract : In this work we present a method to perform a complete audiovisual source separation without need of previous information. This method is based on the assumption that sounds are caused by moving structures. Thus, an efficient representation of audio and video sequences allows to build relationships between synchronous structures on both modalities. A robust clustering algorithm groups video structures exhibiting strong correlations with the audio so that sources are counted and located in the image. Using such information and exploiting audio-video correlation, the audio sources activity is determined. Next, spectral Gaussian Mixture Models (GMMs) are learnt in time slots with only one source active so that it is possible to separate them in case of an audio mixture. Audio source separation performances are rigorously evaluated, clearly showing that the proposed algorithm performs efficiently and robustly.
Mots-clés : source separation
Complete list of metadatas

Cited literature [18 references]  Display  Hide  Download

https://hal.inria.fr/inria-00544949
Contributor : Rémi Gribonval <>
Submitted on : Saturday, December 11, 2010 - 5:27:48 PM
Last modification on : Friday, November 16, 2018 - 1:23:42 AM

File

2006_IEEE_TSALP_BenaroyaBimbot...
Publisher files allowed on an open archive

Identifiers

Citation

Laurent Benaroya, Frédéric Bimbot, Rémi Gribonval. Audio Source Separation With a Single Sensor. IEEE Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers, 2006, 14 (1), pp.191--199. ⟨10.1109/TSA.2005.854110⟩. ⟨inria-00544949⟩

Share

Metrics

Record views

636

Files downloads

1099