Video Covariance Matrix Logarithm for Human Action Recognition in Videos

Piotr Bilinski; Francois Bremond

Communication Dans Un Congrès Année : 2015

Video Covariance Matrix Logarithm for Human Action Recognition in Videos

(1) , (1)

Piotr Bilinski

Fonction : Auteur
PersonId : 8327
IdHAL : piotr-bilinski
IdRef : 184465303

Spatio-Temporal Activity Recognition Systems

Francois Bremond

Fonction : Auteur
PersonId : 20805
IdHAL : francois-bremond
ORCID : 0000-0003-2988-2142
IdRef : 138919046

Spatio-Temporal Activity Recognition Systems

Résumé

In this paper, we propose a new local spatio-temporal descriptor for videos and we propose a new approach for action recognition in videos based on the introduced descriptor. The new descriptor is called the Video Covariance Matrix Logarithm (VCML). The VCML descriptor is based on a covariance matrix representation, and it models relationships between different low-level features, such as intensity and gradient. We apply the VCML descriptor to encode appearance information of local spatio-temporal video volumes, which are extracted by the Dense Trajectories. Then, we present an extensive evaluation of the proposed VCML descriptor with the Fisher vector encoding and the Support Vector Machines on four challenging action recognition datasets. We show that the VCML descriptor achieves better results than the state-of-the-art appearance descriptors. Moreover, we present that the VCML descriptor carries complementary information to the HOG descriptor and their fusion gives a significant improvement in action recognition accuracy. Finally, we show that the VCML descriptor improves action recognition accuracy in comparison to the state-of-the-art Dense Trajectories, and that the proposed approach achieves superior performance to the state-of-the-art methods.

Domaines

Vision par ordinateur et reconnaissance de formes [cs.CV]

Fichier principal

P. Bilinski and F. Bremond - Video Covariance Matrix Logarithm for Human Action Recognition in Videos - IJCAI 2015.pdf (1.25 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Piotr Bilinski : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-01216849

Soumis le : samedi 17 octobre 2015-18:01:46

Dernière modification le : mercredi 15 mars 2023-08:58:09

Archivage à long terme le : jeudi 27 avril 2017-07:18:32

Dates et versions

hal-01216849 , version 1 (17-10-2015)

Identifiants

HAL Id : hal-01216849 , version 1

Citer

Piotr Bilinski, Francois Bremond. Video Covariance Matrix Logarithm for Human Action Recognition in Videos. IJCAI 2015 - 24th International Joint Conference on Artificial Intelligence (IJCAI), Jul 2015, Buenos Aires, Argentina. ⟨hal-01216849⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

INRIA INRIA2 UNIV-COTEDAZUR

248 Consultations

313 Téléchargements

Video Covariance Matrix Logarithm for Human Action Recognition in Videos

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager