A Robust Method to Count and Locate Audio Sources in a Stereophonic Linear Anechoic Mixture

Simon Arberet; Rémi Gribonval; Frédéric Bimbot

doi:10.1109/ICASSP.2007.366787

Communication Dans Un Congrès Année : 2007

A Robust Method to Count and Locate Audio Sources in a Stereophonic Linear Anechoic Mixture

(1) , (1) , (1)

Simon Arberet

Fonction : Auteur

Speech and sound data modeling and processing

Rémi Gribonval

Fonction : Auteur
PersonId : 1255
IdHAL : remi-gribonval
ORCID : 0000-0002-9450-8125
IdRef : 113181590

Speech and sound data modeling and processing

Frédéric Bimbot

Fonction : Auteur
PersonId : 830967

Speech and sound data modeling and processing

Résumé

We propose a new method, called DEMIX Anechoic, to estimate the mixing conditions, i.e. number of audio sources plus attenuation and time delay of each sources, in an underdetermined anechoic mixture. The method relies on the assumption that in the neighborhood of some time-frequency points, only one source contributes to the mixture. Such time-frequency points, located with a local confidence measure, provide estimates of the attenuation, as well as the phase difference at some frequency, of the corresponding source. The time delay parameters are estimated, by a method similar to GCC-PHAT, on points having close attenuations. As opposed to DUET like methods, our method can estimate time-delay higher than only one sample. Experiments show that DEMIX Anechoic estimates, in more than 65% of the cases, the number of directions until 6 sources and outperforms DUET in the accuracy of the estimation by a factor of 10.

Domaines

Traitement du signal et de l'image [eess.SP] Traitement du signal et de l'image [eess.SP]

Fichier principal

2007_ICASSP_arberet.pdf (221.65 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Rémi Gribonval : Connectez-vous pour contacter le contributeur

https://inria.hal.science/inria-00544778

Soumis le : lundi 7 février 2011-10:38:08

Dernière modification le : vendredi 24 mars 2023-14:52:53

Archivage à long terme le : dimanche 8 mai 2011-02:35:50

Dates et versions

inria-00544778 , version 1 (07-02-2011)

Identifiants

HAL Id : inria-00544778 , version 1
DOI : 10.1109/ICASSP.2007.366787

Citer

Simon Arberet, Rémi Gribonval, Frédéric Bimbot. A Robust Method to Count and Locate Audio Sources in a Stereophonic Linear Anechoic Mixture. Proc. IEEE Intl. Conf. Acoust. Speech Signal Process (ICASSP'07), Apr 2007, Honolulu, Hawai, United States. pp.III-745 - III-748, ⟨10.1109/ICASSP.2007.366787⟩. ⟨inria-00544778⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

EC-PARIS UNIV-RENNES1 CNRS INRIA INSA-RENNES IRISA IRISA-D5 INRIA2 UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES INSA-GROUPE UR1-MATH-NUM

116 Consultations

172 Téléchargements

A Robust Method to Count and Locate Audio Sources in a Stereophonic Linear Anechoic Mixture

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager