A Latently Constrained Mixture Model for Audio Source Separation and Localization

Antoine Deleforge; Radu Horaud

doi:10.1007/978-3-642-28551-6_46

Communication Dans Un Congrès Année : 2012

A Latently Constrained Mixture Model for Audio Source Separation and Localization

(1) , (1)

Antoine Deleforge

Fonction : Auteur
PersonId : 10056
IdHAL : antoine-deleforge
ORCID : 0000-0003-0339-7472
IdRef : 184451205

Interpretation and Modelling of Images and Videos

Radu Horaud

Fonction : Auteur correspondant
PersonId : 16183
IdHAL : radu-horaud
ORCID : 0000-0001-5232-024X
IdRef : 032302495

Connectez-vous pour contacter l'auteur

Interpretation and Modelling of Images and Videos

Résumé

We present a method for audio source separation and localization from binaural recordings. The method combines a new generative probabilistic model with time-frequency masking. We suggest that device-dependent relationships between point-source positions and interaural spectral cues may be learnt in order to constrain a mixture model. This allows to capture subtle separation and localization features embedded in the auditory data. We illustrate our method with data composed of two and three mixed speech signals in the presence of reverberations. Using standard evaluation metrics, we compare our method with a recent binaural-based source separation-localization algorithm.

Mots clés

sound-source separation mixture models expectation maximization binaural hearing head-related transfer function

Domaines

Vision par ordinateur et reconnaissance de formes [cs.CV] Traitement du signal et de l'image [eess.SP] Traitement du signal et de l'image [eess.SP]

Fichier principal

LVA12_submission_revised.pdf (181.93 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Perception team : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-00768660

Soumis le : samedi 22 décembre 2012-18:08:06

Dernière modification le : jeudi 4 avril 2024-21:20:04

Archivage à long terme le : samedi 23 mars 2013-03:47:36

Dates et versions

hal-00768660 , version 1 (22-12-2012)

Identifiants

HAL Id : hal-00768660 , version 1
DOI : 10.1007/978-3-642-28551-6_46

Citer

Antoine Deleforge, Radu Horaud. A Latently Constrained Mixture Model for Audio Source Separation and Localization. 10th International Conference on Latent Variable Analysis and Signal Separation, Mar 2012, Tel Aviv, Israel. pp.372--379, ⟨10.1007/978-3-642-28551-6_46⟩. ⟨hal-00768660⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-RENNES1 UGA CNRS INRIA IRISA LJK LJK_GI LJK_GI_PERCEPTION INRIA2 UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES UR1-MATH-NUM

417 Consultations

190 Téléchargements

A Latently Constrained Mixture Model for Audio Source Separation and Localization

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager