The CAVA corpus: synchronised stereoscopic and binaural datasets with head movements

Elise Arnaud 1 Heidi Christensen 2 Yan-Chen Lu 2 Jon Barker 2 Vasil Khalidov 3 Miles Hansard 1 Bertrand Holveck 1 Herve Mathieu 1 Ramya Narasimha 1 Elise Taillant 1 Florence Forbes 3 Radu Horaud 1
1 PERCEPTION - Interpretation and Modelling of Images and Videos
Inria Grenoble - Rhône-Alpes, LJK - Laboratoire Jean Kuntzmann, INPG - Institut National Polytechnique de Grenoble
3 MISTIS - Modelling and Inference of Complex and Structured Stochastic Systems
Inria Grenoble - Rhône-Alpes, LJK - Laboratoire Jean Kuntzmann, INPG - Institut National Polytechnique de Grenoble
Abstract : This paper describes the acquisition and content of a new multi-modal database. Some tools for making use of the data streams are also presented. The Computational Audio- Visual Analysis (CAVA) database is a unique collection of three synchronised data streams obtained from a binaural microphone pair, a stereoscopic camera pair and a head tracking device. All recordings are made from the perspective of a person; i.e. what would a human with natural head movements see and hear in a given environment. The database is intended to facilitate research into humans' ability to optimise their multi-modal sensory input and fills a gap by providing data that enables human centred audiovisual scene analysis. It also enables 3D localisation using either audio, visual, or audio-visual cues. A total of 50 sessions, with varying degrees of visual and auditory complexity, were recorded. These range from seeing and hearing a single speaker moving in and out of field of view, to moving around a 'cocktail party' style situation, mingling and joining different small groups of people chatting.
Type de document :
Communication dans un congrès
ICMI 2008 - ACM/IEEE International Conference on Multimodal Interfaces, Oct 2008, Chania, Greece. ACM, pp.109-116, 2008, 〈10.1145/1452392.1452414〉
Liste complète des métadonnées


https://hal.inria.fr/inria-00373173
Contributeur : Elise Arnaud <>
Soumis le : vendredi 3 avril 2009 - 15:06:51
Dernière modification le : mercredi 11 avril 2018 - 01:58:39
Document(s) archivé(s) le : vendredi 12 octobre 2012 - 16:11:01

Fichiers

icmi08-cava.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

Citation

Elise Arnaud, Heidi Christensen, Yan-Chen Lu, Jon Barker, Vasil Khalidov, et al.. The CAVA corpus: synchronised stereoscopic and binaural datasets with head movements. ICMI 2008 - ACM/IEEE International Conference on Multimodal Interfaces, Oct 2008, Chania, Greece. ACM, pp.109-116, 2008, 〈10.1145/1452392.1452414〉. 〈inria-00373173〉

Partager

Métriques

Consultations de la notice

590

Téléchargements de fichiers

256