28967 articles – 22394 references  [version française]

inria-00439769, version 1

Evaluation of local spatio-temporal features for action recognition

Heng Wang 12, Muhammad Muneeb Ullah () a3, Alexander Klaser () b1, Ivan Laptev () a3, Cordelia Schmid () a1

BMVC 2009 - British Machine Vision Conference (2009)

Abstract: Local space-time features have recently become a popular video representation for action recognition. Several methods for feature localization and description have been proposed in the literature and promising recognition results were demonstrated for a number of action classes. The comparison of existing methods, however, is often limited given the different experimental settings used. The purpose of this paper is to evaluate and compare previously proposed space-time features in a common experimental setup. In particular, we consider four different feature detectors and six local feature descriptors and use a standard bag-of-features SVM approach for action recognition. We investigate the performance of these methods on a total of 25 action classes distributed over three datasets with varying difficulty. Among interesting conclusions, we demonstrate that regular sampling of space-time features consistently outperforms all tested space-time interest point detectors for human actions in realistic settings. We also demonstrate a consistent ranking for the majority of methods over different datasets and discuss their advantages and limitations.

  • Icone de 01_interest_points_kth.png
  • a –  INRIA
  • b –  CNRS
  • 1:  LEAR (INRIA Grenoble Rhône-Alpes / LJK Laboratoire Jean Kuntzmann)
  • CNRS : UMR5527 – INRIA – Laboratoire Jean Kuntzmann – Université Joseph Fourier - Grenoble I – Institut National Polytechnique de Grenoble (INPG)
  • 2:  Pattern (LIAMA)
  • Centre de coopération internationale en recherche agronomique pour le développement [CIRAD] – CNRS – Institut national de la recherche agronomique (INRA) – INRIA – Chinese Academy of Science (CAS) – Institute of Automation, Chinese Academy of Sciences
  • 3:  VISTAS (INRIA - IRISA)
  • INRIA – Institut National des Sciences Appliquées (INSA) - Rennes – CNRS : UMR6074 – Université de Rennes 1 – École normale supérieure de Cachan - ENS Cachan
  • Domain : Computer Science/Computer Vision and Pattern Recognition
  • Keywords : descriptor – evaluation – action – recognition – classification – SVM – BOW – BOF – 3D – videos – KTH – Hollywood2 – Hollywood – UCF sports
 
  • inria-00439769, version 1
  • oai:hal.inria.fr:inria-00439769
  • From: 
  • Submitted on: Friday, 8 April 2011 14:18:13
  • Updated on: Tuesday, 10 April 2012 11:29:25