sign in
english version rss feed

inria-00514853, version 1

A Spatio-Temporal Descriptor Based on 3D-Gradients

Alexander Klaser b1, Marcin Marszałek a2, Cordelia Schmid () b1

British Machine Vision Conference (2008)

Abstract: In this work, we present a novel local descriptor for video sequences. The proposed descriptor is based on histograms of oriented 3D spatio-temporal gradients. Our contribution is four-fold. (i) To compute 3D gradients for arbitrary scales, we develop a memory-efficient algorithm based on integral videos. (ii) We propose a generic 3D orientation quantization which is based on regular polyhedrons. (iii) We perform an in-depth evaluation of all descriptor parameters and optimize them for action recognition. (iv) We apply our descriptor to various action datasets (KTH, Weizmann, Hollywood) and show that we outperform the state-of-the-art.

  • Icone de 3ddesc.png
  • Collaboration : Grid'5000
  • Domain : Computer Science/Computer Vision and Pattern Recognition
 
  • inria-00514853, version 1
  • oai:hal.inria.fr:inria-00514853
  • From: 
  • Submitted on: Friday, 3 September 2010 14:15:04
  • Updated on: Monday, 23 April 2012 16:18:15
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...