Perspective-Aware CNN For Crowd Counting - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Rapport (Rapport De Recherche) Année : 2018

Perspective-Aware CNN For Crowd Counting

Résumé

Crowd counting is the task of estimating pedestrian numbers in crowd images. Modern crowd counting methods employ deep neural networks to estimate crowd counts via crowd density regressions. A major challenge of this task lies in the drastic changes of scales and perspectives in images. Representative approaches usually utilize different (large) sized filters and conduct patch-based estimations to tackle it, which is however computationally expensive. In this paper, we propose a perspective-aware convolutional neural network (PACNN) with a single backbone of small filters (e.g. 3 x 3). It directly predicts a perspective map in the network and encodes it as a perspective-aware weighting layer to adaptively combine the density outputs from multi-scale feature maps. The weights are learned at every pixel of the map such that the final combination is robust to perspective changes and pedestrian size variations. We conduct extensive experiments on the ShanghaiTech, WorldExpo'10 and UCF_CC_50 datasets, and demonstrate that PACNN achieves state-of-the-art results and runs as fast as the fastest.
Fichier principal
Vignette du fichier
PACNN-arxiv.pdf (2.32 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01831109 , version 1 (05-07-2018)
hal-01831109 , version 2 (09-12-2018)
hal-01831109 , version 3 (27-11-2019)

Identifiants

  • HAL Id : hal-01831109 , version 1

Citer

Miaojing Shi, Zhaohui Yang, Chao Xu, Qijun Chen. Perspective-Aware CNN For Crowd Counting. [Research Report] Inria Rennes - Bretagne Atlantique. 2018, pp.1-10. ⟨hal-01831109v1⟩
627 Consultations
512 Téléchargements

Partager

Gmail Facebook X LinkedIn More