Where to Focus on for Human Action Recognition?

Srijan Das; Arpit Chaudhary; Francois Bremond; Monique Thonnat

Communication Dans Un Congrès Année : 2019

Where to Focus on for Human Action Recognition?

(1) , (1) , (1) , (1)

Srijan Das

Fonction : Auteur
PersonId : 21855
IdHAL : srijan-das

Spatio-Temporal Activity Recognition Systems

Arpit Chaudhary

Fonction : Auteur

Spatio-Temporal Activity Recognition Systems

Francois Bremond

Fonction : Auteur
PersonId : 20805
IdHAL : francois-bremond
ORCID : 0000-0003-2988-2142
IdRef : 138919046

Spatio-Temporal Activity Recognition Systems

Monique Thonnat

Fonction : Auteur
PersonId : 873895

Spatio-Temporal Activity Recognition Systems

Résumé

In this paper, we present a new attention model for the recognition of human action from RGB-D videos. We propose an attention mechanism based on 3D articulated pose. The objective is to focus on the most relevant body parts involved in the action. For action classification, we propose a classification network compounded of spatio-temporal sub-networks modeling the appearance of human body parts and RNN attention subnetwork implementing our attention mechanism. Furthermore, we train our proposed network end-to-end using a regularized cross-entropy loss, leading to a joint training of the RNN delivering attention globally to the whole set of spatio-temporal features, extracted from 3D ConvNets. Our method outperforms the State-of-the-art methods on the largest human activity recognition dataset available to-date (NTU RGB+D Dataset) which is also multi-views and on a human action recognition dataset with object interaction (Northwestern-UCLA Multiview Action 3D Dataset).

Domaines

Vision par ordinateur et reconnaissance de formes [cs.CV]

Fichier principal

421.pdf (767.4 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

SRIJAN DAS : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-01927432

Soumis le : lundi 19 novembre 2018-19:08:55

Dernière modification le : mercredi 15 mars 2023-08:58:09

Archivage à long terme le : mercredi 20 février 2019-16:18:21

Dates et versions

hal-01927432 , version 1 (19-11-2018)

Identifiants

HAL Id : hal-01927432 , version 1

Citer

Srijan Das, Arpit Chaudhary, Francois Bremond, Monique Thonnat. Where to Focus on for Human Action Recognition?. WACV 2019 - IEEE Winter Conference on Applications of Computer Vision, Jan 2019, Waikoloa Village, Hawaii, United States. pp.1-10. ⟨hal-01927432⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

INRIA INRIA2 UNIV-COTEDAZUR OPAL

295 Consultations

1930 Téléchargements

Where to Focus on for Human Action Recognition?

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager