Improved Motion Description for Action Classification - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Article Dans Une Revue Frontiers in information and communication technologies Année : 2016

Improved Motion Description for Action Classification

Résumé

Even though the importance of explicitly integrating motion characteristics in video descriptions has been demonstrated by several recent papers on action classification, our current work concludes that adequately decomposing visual motion into dominant and residual motions, i.e., camera and scene motion, significantly improves action recognition algorithms. This holds true both for the extraction of the space-time trajectories and for computation of descriptors. We designed a new motion descriptor – the DCS descriptor – that captures additional information on local motion patterns enhancing results based on differential motion scalar quantities, divergence, curl, and shear features. Finally, applying the recent VLAD coding technique proposed in image retrieval provides a substantial improvement for action recognition. These findings are complementary to each other and they outperformed all previously reported results by a significant margin on three challenging datasets: Hollywood 2, HMDB51, and Olympic Sports as reported in Jain et al. (2013). These results were further improved by Oneata et al. (2013), Wang and Schmid (2013), and Zhu et al. (2013) through the use of the Fisher vector encoding. We therefore also employ Fisher vector in this paper, and we further enhance our approach by combining trajectories from both optical flow and compensated flow. We as well provide additional details of DCS descriptors, including visualization. For extending the evaluation, a novel dataset with 101 action classes, UCF101, was added.

Dates et versions

hal-01401833 , version 1 (23-11-2016)

Identifiants

Citer

Mihir Jain, Hervé Jégou, Patrick Bouthemy. Improved Motion Description for Action Classification . Frontiers in information and communication technologies, 2016, Computer Image Analysis, ⟨10.3389/fict.2015.00028⟩. ⟨hal-01401833⟩
179 Consultations
0 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More