Audio Source Separation With a Single Sensor

Laurent Benaroya; Frédéric Bimbot; Rémi Gribonval

doi:10.1109/TSA.2005.854110

Article Dans Une Revue IEEE Transactions on Audio, Speech and Language Processing Année : 2006

Audio Source Separation With a Single Sensor

(1) , (1) , (1)

Laurent Benaroya

Fonction : Auteur
PersonId : 7037
IdHAL : elie-laurent-benaroya
IdRef : 07600953X

Speech and sound data modeling and processing

Frédéric Bimbot

Fonction : Auteur
PersonId : 830967

Speech and sound data modeling and processing

Rémi Gribonval

Fonction : Auteur
PersonId : 1255
IdHAL : remi-gribonval
ORCID : 0000-0002-9450-8125
IdRef : 113181590

Speech and sound data modeling and processing

Résumé

In this work we present a method to perform a complete audiovisual source separation without need of previous information. This method is based on the assumption that sounds are caused by moving structures. Thus, an efficient representation of audio and video sequences allows to build relationships between synchronous structures on both modalities. A robust clustering algorithm groups video structures exhibiting strong correlations with the audio so that sources are counted and located in the image. Using such information and exploiting audio-video correlation, the audio sources activity is determined. Next, spectral Gaussian Mixture Models (GMMs) are learnt in time slots with only one source active so that it is possible to separate them in case of an audio mixture. Audio source separation performances are rigorously evaluated, clearly showing that the proposed algorithm performs efficiently and robustly.

Mots clés

source separation

Domaines

Traitement du signal et de l'image [eess.SP] Traitement du signal et de l'image [eess.SP]

Fichier principal

2006_IEEE_TSALP_BenaroyaBimbotGribonval_AudioSepOneSensor.pdf (412.37 Ko)

Origine : Fichiers éditeurs autorisés sur une archive ouverte

Rémi Gribonval : Connectez-vous pour contacter le contributeur

https://inria.hal.science/inria-00544949

Soumis le : samedi 11 décembre 2010-17:27:48

Dernière modification le : vendredi 24 mars 2023-14:52:53

Dates et versions

inria-00544949 , version 1 (11-12-2010)

Identifiants

HAL Id : inria-00544949 , version 1
DOI : 10.1109/TSA.2005.854110

Citer

Laurent Benaroya, Frédéric Bimbot, Rémi Gribonval. Audio Source Separation With a Single Sensor. IEEE Transactions on Audio, Speech and Language Processing, 2006, 14 (1), pp.191--199. ⟨10.1109/TSA.2005.854110⟩. ⟨inria-00544949⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

EC-PARIS UNIV-RENNES1 CNRS INRIA INSA-RENNES IRISA IRISA-D5 INRIA2 UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES INSA-GROUPE UR1-MATH-NUM

304 Consultations

1123 Téléchargements

Audio Source Separation With a Single Sensor

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager