Will person detection help bag-of-features action recognition?

Alexander Klaser 1 Marcin Marszałek 2 Ivan Laptev 3 Cordelia Schmid 1
1 LEAR - Learning and recognition in vision
Inria Grenoble - Rhône-Alpes, LJK - Laboratoire Jean Kuntzmann, INPG - Institut National Polytechnique de Grenoble
3 WILLOW - Models of visual object recognition and scene understanding
CNRS - Centre National de la Recherche Scientifique : UMR8548, Inria Paris-Rocquencourt, DI-ENS - Département d'informatique de l'École normale supérieure
Abstract : Bag-of-feature (BoF) models currently achieve state-of-the-art performance for action recognition. While such models do not explicitly account for people in video, person localization combined with BoF is expected to give further improvement for action recognition. The purpose of this paper is to validate this assumption and to quantify the improvements in action recognition expected from current and future person detectors. Given locations of people in video, we find that---somewhat surprisingly---background suppression leads only to a limited gain in performance. This holds for actions in both simple and complex scenes. On the other hand, we show how spatial locations of people enable to incorporate strong geometrical constraints in BoF models and in this way to improve the accuracy of action recognition in some cases. Our conclusions are validated with extensive experiments on three datasets with varying complexity, basic KTH, realistic UCF Sports and challenging Hollywood.
Type de document :
[Research Report] RR-7373, INRIA. 2010
Liste complète des métadonnées

Contributeur : Alexander Klaser <>
Soumis le : vendredi 3 septembre 2010 - 12:59:47
Dernière modification le : jeudi 7 février 2019 - 15:49:57
Document(s) archivé(s) le : mardi 23 octobre 2012 - 15:30:40


Fichiers produits par l'(les) auteur(s)


  • HAL Id : inria-00514828, version 1


Alexander Klaser, Marcin Marszałek, Ivan Laptev, Cordelia Schmid. Will person detection help bag-of-features action recognition?. [Research Report] RR-7373, INRIA. 2010. 〈inria-00514828〉



Consultations de la notice


Téléchargements de fichiers