Rectified binaural ratio: A complex T-distributed feature for robust sound localization - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 2016

Rectified binaural ratio: A complex T-distributed feature for robust sound localization

Résumé

Most existing methods in binaural sound source localization rely on some kind of aggregation of phase-and level-difference cues in the time-frequency plane. While different aggregation schemes exist, they are often heuristic and suffer in adverse noise conditions. In this paper, we introduce the rectified binaural ratio as a new feature for sound source localization. We show that for Gaussian-process point source signals corrupted by stationary Gaussian noise, this ratio follows a complex t-distribution with explicit parameters. This new formulation provides a principled and statistically sound way to aggregate binaural features in the presence of noise. We subsequently derive two simple and efficient methods for robust relative transfer function and time-delay estimation. Experiments on heavily corrupted simulated and speech signals demonstrate the robustness of the proposed scheme.
Fichier principal
Vignette du fichier
main_studentloc.pdf (256.71 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01372337 , version 1 (27-09-2016)

Identifiants

Citer

Antoine Deleforge, Florence Forbes. Rectified binaural ratio: A complex T-distributed feature for robust sound localization. European Signal Processing Conference, Aug 2016, Budapest, Hungary. pp.1257-1261, ⟨10.1109/EUSIPCO.2016.7760450⟩. ⟨hal-01372337⟩
754 Consultations
154 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More