Reverberant Sound Localization with a Robot Head Based on Direct-Path Relative Transfer Function

Xiaofei Li 1 Laurent Girin 1, 2 Fabien Badeig 1 Radu Horaud 1
1 PERCEPTION - Interpretation and Modelling of Images and Videos
Inria Grenoble - Rhône-Alpes, LJK - Laboratoire Jean Kuntzmann, INPG - Institut National Polytechnique de Grenoble
2 GIPSA-CRISSP - CRISSP
GIPSA-DPC - Département Parole et Cognition
Abstract : This paper addresses the problem of sound-source localization (SSL) with a robot head, which remains a challenge in real-world environments. In particular we are interested in locating speech sources, as they are of high interest for human-robot interaction. The microphone-pair response corresponding to the direct-path sound propagation is a function of the source direction. In practice, this response is contaminated by noise and reverberations. The direct-path relative transfer function (DP-RTF) is defined as the ratio between the direct-path acoustic transfer function (ATF) of the two microphones, and it is an important feature for SSL. We propose a method to estimate the DP-RTF from noisy and reverberant signals in the short-time Fourier transform (STFT) domain. First, the convolutive transfer function (CTF) approximation is adopted to accurately represent the impulse response of the microphone array, and the first coefficient of the CTF is mainly composed of the direct-path ATF. At each frequency, the frame-wise speech auto-and cross-power spectral density (PSD) are obtained by spectral subtraction. Then a set of linear equations is constructed by the speech auto-and cross-PSD of multiple frames, in which the DP-RTF is an unknown variable, and is estimated by solving the equations. Finally, the estimated DP-RTFs are concatenated across frequencies and used as a feature vector for SSL. Experiments with a robot, placed in various reverberant environments, show that the proposed method outperforms two state-of-the-art methods.
Type de document :
Communication dans un congrès
IEEE/RSJ International Conference on Intelligent Robots and Systems, Oct 2016, Daejeon, South Korea. IEEE, pp.2819-2826, 2016, 〈http://www.iros2016.org/〉. 〈10.1109/IROS.2016.7759437〉
Liste complète des métadonnées

Littérature citée [33 références]  Voir  Masquer  Télécharger


https://hal.inria.fr/hal-01349771
Contributeur : Team Perception <>
Soumis le : jeudi 28 juillet 2016 - 16:23:59
Dernière modification le : mercredi 11 avril 2018 - 01:59:32
Document(s) archivé(s) le : samedi 29 octobre 2016 - 11:38:52

Fichiers

xiaofei_IROS.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

Citation

Xiaofei Li, Laurent Girin, Fabien Badeig, Radu Horaud. Reverberant Sound Localization with a Robot Head Based on Direct-Path Relative Transfer Function. IEEE/RSJ International Conference on Intelligent Robots and Systems, Oct 2016, Daejeon, South Korea. IEEE, pp.2819-2826, 2016, 〈http://www.iros2016.org/〉. 〈10.1109/IROS.2016.7759437〉. 〈hal-01349771〉

Partager

Métriques

Consultations de la notice

872

Téléchargements de fichiers

265