Rectified binaural ratio: A complex T-distributed feature for robust sound localization - Archive ouverte HAL Access content directly
Conference Papers Year : 2016

Rectified binaural ratio: A complex T-distributed feature for robust sound localization

(1) , (2)
1
2

Abstract

Most existing methods in binaural sound source localization rely on some kind of aggregation of phase-and level-difference cues in the time-frequency plane. While different aggregation schemes exist, they are often heuristic and suffer in adverse noise conditions. In this paper, we introduce the rectified binaural ratio as a new feature for sound source localization. We show that for Gaussian-process point source signals corrupted by stationary Gaussian noise, this ratio follows a complex t-distribution with explicit parameters. This new formulation provides a principled and statistically sound way to aggregate binaural features in the presence of noise. We subsequently derive two simple and efficient methods for robust relative transfer function and time-delay estimation. Experiments on heavily corrupted simulated and speech signals demonstrate the robustness of the proposed scheme.
Fichier principal
Vignette du fichier
main_studentloc.pdf (256.71 Ko) Télécharger le fichier
Origin : Files produced by the author(s)
Loading...

Dates and versions

hal-01372337 , version 1 (27-09-2016)

Identifiers

Cite

Antoine Deleforge, Florence Forbes. Rectified binaural ratio: A complex T-distributed feature for robust sound localization. European Signal Processing Conference, Aug 2016, Budapest, Hungary. pp.1257-1261, ⟨10.1109/EUSIPCO.2016.7760450⟩. ⟨hal-01372337⟩
722 View
118 Download

Altmetric

Share

Gmail Facebook Twitter LinkedIn More