Alignment of Binocular-Binaural Data Using a Moving Audio-Visual Target - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 2013

Alignment of Binocular-Binaural Data Using a Moving Audio-Visual Target

Résumé

In this paper we address the problem of aligning visual (V) and auditory (A) data using a sensor that is composed of a camera-pair and a microphone-pair. The original contribution of the paper is a method for AV data aligning through estimation of the 3D positions of the microphones in the visual-centred coordinate frame defined by the stereo camera-pair. We exploit the fact that these two distinct data sets are conditioned by a common set of parameters, namely the (unknown) 3D trajectory of an AV object, and derive an EM-like algorithm that alternates between the estimation of the microphone-pair position and the estimation of the AV object trajectory. The proposed algorithm has a number of built-in features: it can deal with A and V observations that are misaligned in time, it estimates the reliability of the data, it is robust to outliers in both modalities, and it has proven theoretical convergence. We report experiments with both simulated and real data.
Fichier principal
Vignette du fichier
Khalidov-MMSP13.pdf (2.38 Mo) Télécharger le fichier
Vignette du fichier
Khalidov-MMSP13.jpg (148.86 Ko) Télécharger le fichier
Vignette du fichier
bestpaperaward-MMSP2013.pdf (262.26 Ko) Télécharger le fichier
poster_2013_MMSP.pdf (2.49 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Format : Figure, Image
Format : Figure, Image
Format : Autre

Dates et versions

hal-00861482 , version 1 (12-09-2013)
hal-00861482 , version 2 (04-10-2013)
hal-00861482 , version 3 (04-10-2013)

Identifiants

  • HAL Id : hal-00861482 , version 2

Citer

Vasil Khalidov, Florence Forbes, Radu Horaud. Alignment of Binocular-Binaural Data Using a Moving Audio-Visual Target. IEEE Workshop on Multimedia Signal Processing, IEEE Signal Processing Society, Sep 2013, Pula (Sardinia), Italy. ⟨hal-00861482v2⟩
487 Consultations
384 Téléchargements

Partager

Gmail Facebook X LinkedIn More