Action recognition via bio-inspired features: The richness of center-surround interaction

Maria-José Escobar 1 Pierre Kornprobst 2
2 NEUROMATHCOMP - Mathematical and Computational Neuroscience
CRISAM - Inria Sophia Antipolis - Méditerranée , JAD - Laboratoire Jean Alexandre Dieudonné : UMR6621
Abstract : Motion is a key feature for a wide class of computer vision approaches to recognize actions. In this article, we show how to define bio-inspired features for action recognition. To do so, we start from a well-established bio-inspired motion model of cortical areas \V1\ and MT. The primary visual cortex, designated as V1, is the first cortical area encountered in the visual stream processing and early responses of \V1\ cells consist in tiled sets of selective spatiotemporal filters. The second cortical area of interest in this article is area \MT\ where \MT\ cells pool incoming information from \V1\ according to the shape and characteristic of their receptive field. To go beyond the classical models and following the observations from Xiao et al. [61], we propose here to model different surround geometries for \MT\ cells receptive fields. Then, we define the so-called bio-inspired features associated to an input video, based on the average activity of \MT\ cells. Finally, we show how these features can be used in a standard classification method to perform action recognition. Results are given for the Weizmann and \KTH\ databases. Interestingly, we show that the diversity of motion representation at the \MT\ level (different surround geometries), is a major advantage for action recognition. On the Weizmann database, the inclusion of different \MT\ surround geometries improved the recognition rate from 63.01 Â± 2.07% up to 99.26 Â± 1.66% in the best case. Similarly, on the \KTH\ database, the recognition rate was significantly improved with the inclusion of \MT\ different surround geometries (from 47.82 Â± 2.71% up to 92.44 Â± 0.01% in the best case). We also discussed the limitations of the current approach which are closely related to the input video duration. These promising results encourage us to further develop bio-inspired models incorporating other brain mechanisms and cortical areas in order to deal with more complex videos.
Type de document :
Article dans une revue
Computer Vision and Image Understanding, Elsevier, 2012, 116 (5), pp.593-605. 〈10.1016/j.cviu.2012.01.002〉
Liste complète des métadonnées

https://hal.inria.fr/hal-00845585
Contributeur : Pierre Kornprobst <>
Soumis le : mercredi 17 juillet 2013 - 14:13:17
Dernière modification le : vendredi 12 janvier 2018 - 01:50:37

Identifiants

Collections

Citation

Maria-José Escobar, Pierre Kornprobst. Action recognition via bio-inspired features: The richness of center-surround interaction. Computer Vision and Image Understanding, Elsevier, 2012, 116 (5), pp.593-605. 〈10.1016/j.cviu.2012.01.002〉. 〈hal-00845585〉

Partager

Métriques

Consultations de la notice

306