Efficient Action Localization with Approximately Normalized Fisher Vectors

Dan Oneata 1, * Jakob Verbeek 1, * Cordelia Schmid 1, *
* Auteur correspondant
1 LEAR - Learning and recognition in vision
Inria Grenoble - Rhône-Alpes, LJK - Laboratoire Jean Kuntzmann, INPG - Institut National Polytechnique de Grenoble
Abstract : The Fisher vector (FV) representation is a high-dimensional extension of the popular bag-of-word representation. Transformation of the FV by power and L2 normalizations has been shown to significantly improve its performance. With these normalizations included, this representation has yielded state-of-the-art results for a wide number of image and video classification and retrieval tasks. The normalizations, however, render the representation non-additive over local descriptors. Combined with its high dimensionality, this makes the FV computationally very expensive for the purpose of localization tasks. In this paper we, first, present approximations to both these normalizations, which yield significant improvements in the memory requirements and computational costs of the FV when used for localization. Second, we show how these approximations can be used to define upper-bounds on the score function that can be efficiently evaluated, which paves the way for the use of branch-and-bound search as an alternative to exhaustive scanning window search. We present experimental evaluation results on classification and temporal localization of actions in videos. These show that the proposed approximations lead to speed-ups of at least one order of magnitude, while maintaining state-of-the-art action localization performance.
Type de document :
Communication dans un congrès
CVPR 2014 - IEEE Conference on Computer Vision & Pattern Recognition, Jun 2014, Columbus, OH, United States. IEEE, pp.2545-2552, 2014, <10.1109/CVPR.2014.326>
Liste complète des métadonnées



https://hal.inria.fr/hal-00979594
Contributeur : Thoth Team <>
Soumis le : mercredi 24 septembre 2014 - 18:40:20
Dernière modification le : samedi 18 février 2017 - 01:07:06

Fichiers

efficient_action_localization....
Fichiers produits par l'(les) auteur(s)

Identifiants

Collections

Citation

Dan Oneata, Jakob Verbeek, Cordelia Schmid. Efficient Action Localization with Approximately Normalized Fisher Vectors. CVPR 2014 - IEEE Conference on Computer Vision & Pattern Recognition, Jun 2014, Columbus, OH, United States. IEEE, pp.2545-2552, 2014, <10.1109/CVPR.2014.326>. <hal-00979594v2>

Partager

Métriques

Consultations de
la notice

703

Téléchargements du document

1114