A time series kernel for action recognition

Adrien Gaidon 1, 2 Zaid Harchaoui 1 Cordelia Schmid 1
1 LEAR - Learning and recognition in vision
Inria Grenoble - Rhône-Alpes, LJK - Laboratoire Jean Kuntzmann, INPG - Institut National Polytechnique de Grenoble
Abstract : We address the problem of action recognition by describing actions as time series of frames and introduce a new kernel to compare their dynamical aspects. Action recognition in realistic videos has been successfully addressed using kernel methods like SVMs. Most existing approaches average local features over video volumes and compare the resulting vectors using kernels on bags of features. In contrast, we model actions as time series of per-frame representations and propose a kernel specifically tailored for the purpose of action recognition. Our main contributions are the following: (i) we provide a new principled way to compare the dynamics and temporal structure of actions by computing the distance between their auto-correlations, (ii) we derive a practical formulation to compute this distance in any feature space deriving from a base kernel between frames and (iii) we report experimental results on recent action recognition datasets showing that it provides useful complementary information to the average distribution of frames, as used in state-of-the-art models based on bag-of-features.
Document type :
Conference papers
Jesse Hoey and Stephen McKenna and Emanuele Trucco. BMVC 2011 - British Machine Vision Conference, Aug 2011, Dundee, United Kingdom. BMVA Press, pp.63.1-63.11, 2011, 〈10.5244/C.25.63〉
Liste complète des métadonnées

Cited literature [7 references]  Display  Hide  Download


https://hal.inria.fr/inria-00613089
Contributor : Thoth Team <>
Submitted on : Friday, August 5, 2011 - 11:09:43 AM
Last modification on : Monday, July 14, 2014 - 10:47:29 PM
Document(s) archivé(s) le : Monday, December 5, 2016 - 12:25:52 AM

Identifiers

Collections

Citation

Adrien Gaidon, Zaid Harchaoui, Cordelia Schmid. A time series kernel for action recognition. Jesse Hoey and Stephen McKenna and Emanuele Trucco. BMVC 2011 - British Machine Vision Conference, Aug 2011, Dundee, United Kingdom. BMVA Press, pp.63.1-63.11, 2011, 〈10.5244/C.25.63〉. 〈inria-00613089v2〉

Share

Metrics

Record views

1082

Document downloads

1565