Statistics of Pairwise Co-occurring Local Spatio-Temporal Features for Human Action Recognition

Piotr Bilinski 1 Francois Bremond 1
1 STARS - Spatio-Temporal Activity Recognition Systems
CRISAM - Inria Sophia Antipolis - Méditerranée
Abstract : The bag-of-words approach with local spatio-temporal features have become a popular video representation for action recognition in videos. Together these techniques have demonstrated high recognition results for a number of action classes. Recent approaches have typically focused on capturing global statistics of features. However, existing methods ignore relations between features and thus may not be discriminative enough. Therefore, we propose a novel feature representation which captures statistics of pairwise co-occurring local spatio-temporal features. Our representation captures not only global distribution of features but also focuses on geometric and appearance (both visual and motion) relations among the features. Calculating a set of bag-of-words representations with different geometrical arrangement among the features, we keep an important association between appearance and geometric information. Using two benchmark datasets for human action recognition, we demonstrate that our representation enhances the discriminative power of features and improves action recognition performance.
Document type :
Conference papers
Andrea Fusiello and Vittorio Murino and Rita Cucchiara. 4th International Workshop on Video Event Categorization, Tagging and Retrieval (VECTaR), in conjunction with 12th European Conference on Computer Vision (ECCV), Oct 2012, Florence, Italy. Springer, 7583, pp.311-320, 2012, Lecture Notes in Computer Science; Computer Vision - ECCV 2012. Workshops and Demonstrations - part I. <10.1007/978-3-642-33863-2_31>
Liste complète des métadonnées


https://hal.inria.fr/hal-00760963
Contributor : Piotr Bilinski <>
Submitted on : Tuesday, December 4, 2012 - 4:19:45 PM
Last modification on : Wednesday, December 14, 2016 - 1:07:15 AM
Document(s) archivé(s) le : Wednesday, March 6, 2013 - 4:50:34 PM

File

Statistics_of_Pairwise_Co-occu...
Files produced by the author(s)

Identifiers

Collections

Citation

Piotr Bilinski, Francois Bremond. Statistics of Pairwise Co-occurring Local Spatio-Temporal Features for Human Action Recognition. Andrea Fusiello and Vittorio Murino and Rita Cucchiara. 4th International Workshop on Video Event Categorization, Tagging and Retrieval (VECTaR), in conjunction with 12th European Conference on Computer Vision (ECCV), Oct 2012, Florence, Italy. Springer, 7583, pp.311-320, 2012, Lecture Notes in Computer Science; Computer Vision - ECCV 2012. Workshops and Demonstrations - part I. <10.1007/978-3-642-33863-2_31>. <hal-00760963>

Share

Metrics

Record views

259

Document downloads

267