Temporal Localization of Actions with Actoms

Adrien Gaidon 1, 2, * Zaid Harchaoui 1 Cordelia Schmid 1
* Auteur correspondant
1 LEAR - Learning and recognition in vision
Inria Grenoble - Rhône-Alpes, LJK - Laboratoire Jean Kuntzmann, INPG - Institut National Polytechnique de Grenoble
Abstract : We address the problem of localizing actions, such as opening a door, in hours of challenging video data. We propose a model based on a sequence of atomic action units, termed "actoms", that are semantically meaningful and characteristic for the action. Our Actom Sequence Model (ASM) represents an action as a sequence of histograms of actom-anchored visual features, which can be seen as a temporally structured extension of the bag-of-features. Training requires the annotation of actoms for action examples. At test time, actoms are localized automatically based on a non-parametric model of the distribution of actoms, which also acts as a prior on an action's temporal structure. We present experimental results on two recent benchmarks for action localization "Coffee and Cigarettes" and the "DLSBP" dataset. We also adapt our approach to a classification-by-localization set-up, and demonstrate its applicability on the challenging "Hollywood 2" dataset. We show that our ASM method outperforms the current state of the art in temporal action localization, as well as baselines that localize actions with a sliding window method.
Type de document :
Article dans une revue
IEEE Transactions on Pattern Analysis and Machine Intelligence, Institute of Electrical and Electronics Engineers, 2013, 35 (11), pp.2782-2795. 〈10.1109/TPAMI.2013.65〉
Liste complète des métadonnées


https://hal.inria.fr/hal-00804627
Contributeur : Thoth Team <>
Soumis le : mardi 26 mars 2013 - 08:21:16
Dernière modification le : mercredi 11 avril 2018 - 01:58:30
Document(s) archivé(s) le : jeudi 27 juin 2013 - 04:00:18

Fichiers

ASM_TPAMI_Gaidon.pdf
Fichiers éditeurs autorisés sur une archive ouverte

Identifiants

Collections

Citation

Adrien Gaidon, Zaid Harchaoui, Cordelia Schmid. Temporal Localization of Actions with Actoms. IEEE Transactions on Pattern Analysis and Machine Intelligence, Institute of Electrical and Electronics Engineers, 2013, 35 (11), pp.2782-2795. 〈10.1109/TPAMI.2013.65〉. 〈hal-00804627〉

Partager

Métriques

Consultations de la notice

1047

Téléchargements de fichiers

2320