HAL will be down for maintenance from Friday, June 10 at 4pm through Monday, June 13 at 9am. More information
Skip to Main content Skip to Navigation
Conference papers

Video Covariance Matrix Logarithm for Human Action Recognition in Videos

Piotr Bilinski 1 Francois Bremond 1
1 STARS - Spatio-Temporal Activity Recognition Systems
CRISAM - Inria Sophia Antipolis - Méditerranée
Abstract : In this paper, we propose a new local spatio-temporal descriptor for videos and we propose a new approach for action recognition in videos based on the introduced descriptor. The new descriptor is called the Video Covariance Matrix Logarithm (VCML). The VCML descriptor is based on a covariance matrix representation, and it models relationships between different low-level features, such as intensity and gradient. We apply the VCML descriptor to encode appearance information of local spatio-temporal video volumes, which are extracted by the Dense Trajectories. Then, we present an extensive evaluation of the proposed VCML descriptor with the Fisher vector encoding and the Support Vector Machines on four challenging action recognition datasets. We show that the VCML descriptor achieves better results than the state-of-the-art appearance descriptors. Moreover, we present that the VCML descriptor carries complementary information to the HOG descriptor and their fusion gives a significant improvement in action recognition accuracy. Finally, we show that the VCML descriptor improves action recognition accuracy in comparison to the state-of-the-art Dense Trajectories, and that the proposed approach achieves superior performance to the state-of-the-art methods.
Document type :
Conference papers
Complete list of metadata

Cited literature [29 references]  Display  Hide  Download

Contributor : Piotr Bilinski Connect in order to contact the contributor
Submitted on : Saturday, October 17, 2015 - 6:01:46 PM
Last modification on : Thursday, January 20, 2022 - 5:28:19 PM
Long-term archiving on: : Thursday, April 27, 2017 - 7:18:32 AM


P. Bilinski and F. Bremond - V...
Files produced by the author(s)


  • HAL Id : hal-01216849, version 1



Piotr Bilinski, Francois Bremond. Video Covariance Matrix Logarithm for Human Action Recognition in Videos. IJCAI 2015 - 24th International Joint Conference on Artificial Intelligence (IJCAI), Jul 2015, Buenos Aires, Argentina. ⟨hal-01216849⟩



Record views


Files downloads