Will person detection help bag-of-features action recognition?

Alexander Klaser 1 Marcin Marszałek 2 Ivan Laptev 3 Cordelia Schmid 1
1 LEAR - Learning and recognition in vision
Inria Grenoble - Rhône-Alpes, LJK - Laboratoire Jean Kuntzmann, INPG - Institut National Polytechnique de Grenoble
3 WILLOW - Models of visual object recognition and scene understanding
DI-ENS - Département d'informatique de l'École normale supérieure, Inria Paris-Rocquencourt, CNRS - Centre National de la Recherche Scientifique : UMR8548
Abstract : Bag-of-feature (BoF) models currently achieve state-of-the-art performance for action recognition. While such models do not explicitly account for people in video, person localization combined with BoF is expected to give further improvement for action recognition. The purpose of this paper is to validate this assumption and to quantify the improvements in action recognition expected from current and future person detectors. Given locations of people in video, we find that---somewhat surprisingly---background suppression leads only to a limited gain in performance. This holds for actions in both simple and complex scenes. On the other hand, we show how spatial locations of people enable to incorporate strong geometrical constraints in BoF models and in this way to improve the accuracy of action recognition in some cases. Our conclusions are validated with extensive experiments on three datasets with varying complexity, basic KTH, realistic UCF Sports and challenging Hollywood.
Complete list of metadatas

https://hal.inria.fr/inria-00514828
Contributor : Alexander Klaser <>
Submitted on : Friday, September 3, 2010 - 12:59:47 PM
Last modification on : Thursday, February 7, 2019 - 3:49:57 PM
Long-term archiving on : Tuesday, October 23, 2012 - 3:30:40 PM

File

RR-7373.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : inria-00514828, version 1

Citation

Alexander Klaser, Marcin Marszałek, Ivan Laptev, Cordelia Schmid. Will person detection help bag-of-features action recognition?. [Research Report] RR-7373, INRIA. 2010. ⟨inria-00514828⟩

Share

Metrics

Record views

957

Files downloads

389