inria-00514853, version 1
A Spatio-Temporal Descriptor Based on 3D-Gradients
Alexander Klaser b, 1Marcin Marszałek a, 2Cordelia Schmid
b, 1
British Machine Vision Conference (2008)
Abstract: In this work, we present a novel local descriptor for video sequences. The proposed descriptor is based on histograms of oriented 3D spatio-temporal gradients. Our contribution is four-fold. (i) To compute 3D gradients for arbitrary scales, we develop a memory-efficient algorithm based on integral videos. (ii) We propose a generic 3D orientation quantization which is based on regular polyhedrons. (iii) We perform an in-depth evaluation of all descriptor parameters and optimize them for action recognition. (iv) We apply our descriptor to various action datasets (KTH, Weizmann, Hollywood) and show that we outperform the state-of-the-art.
- a – University of Oxford
- b – INRIA
- 1: LEAR (INRIA Grenoble Rhône-Alpes / LJK Laboratoire Jean Kuntzmann)
- CNRS : FR71 – CNRS : UMR5527 – INRIA – Laboratoire Jean Kuntzmann – Université Joseph Fourier - Grenoble I – Institut National Polytechnique de Grenoble (INPG)
- 2: Visual Geometry Group (VGG)
- University of Oxford
- Collaboration : Grid'5000
- Domain : Computer Science/Computer Vision and Pattern Recognition
- inria-00514853, version 1
- http://hal.inria.fr/inria-00514853
- oai:hal.inria.fr:inria-00514853
- From: Alexander Klaser
- Submitted on: Friday, 3 September 2010 14:15:04
- Updated on: Monday, 23 April 2012 16:18:15







Associated documents

See also
Export