Y. Bai, Y. Zhang, M. Ding, and B. Ghanem, Finding tiny faces in the wild with generative adversarial network, CVPR, 2018.

A. Bearman, O. Russakovsky, V. Ferrari, and L. Fei-fei, What's the point: Semantic segmentation with point supervision, ECCV, 2016.

Y. Bengio, J. Louradour, R. Collobert, and J. Weston, Curriculum learning, ICML, vol.2, p.5, 2009.

S. Branson, P. Perona, and S. Belongie, Strong supervision from weak annotation: Interactive training of deformable part models, ICCV, 2011.

J. Gabriel, R. Brostow, and . Cipolla, Unsupervised bayesian detection of independent motion in crowds, CVPR, 2006.

B. Antoni, Z. Chan, N. Liang, and . Vasconcelos, Privacy preserving crowd monitoring: Counting people without people models or tracking, CVPR, vol.2, p.4, 2008.

B. Antoni, N. Chan, and . Vasconcelos, Bayesian poisson regression for crowd counting, ICCV, 2009.

R. Girshick, Fast r-cnn, ICCV, vol.3, p.5, 2015.

R. Girshick, J. Donahue, T. Darrell, and J. Malik, Rich feature hierarchies for accurate object detection and semantic segmentation, CVPR, vol.3, p.4, 2014.

R. Guerrero-gómez-olmedo, B. Torre-jiménez, R. López-sastre, S. Maldonado-bascón, and D. Onoro-rubio, Extremely overlapping vehicle counting, Iberian Conference on Pattern Recognition and Image Analysis, vol.6, 2005.

K. He, X. Zhang, S. Ren, and J. Sun, Deep residual learning for image recognition, CVPR, vol.3, p.4, 2016.

P. Hu and D. Ramanan, Finding tiny faces, CVPR, vol.2, p.3, 2017.

H. Idrees, I. Saleemi, C. Seibert, and M. Shah, Multi-source multi-scale counting in extremely dense crowd images, CVPR, vol.2, p.5, 2013.

H. Idrees, M. Tayyab, K. Athrey, D. Zhang, S. Al-maadeed et al., Composition loss for counting, density map estimation and localization in dense crowds, ECCV, 2008.

H. Jiang and E. Learned-miller, Face detection with the faster r-cnn, International Conference on Automatic Face & Gesture Recognition (FG), 2017.

S. Johnson and M. Everingham, Clustered pose and nonlinear appearance models for human pose estimation, p.3, 2010.

A. Krizhevsky, I. Sutskever, and G. E. Hinton, Imagenet classification with deep convolutional neural networks, NIPS, 2012.

H. Issam, N. Laradji, P. O. Rostamzadeh, D. Pinheiro, M. Vazquez et al., Where are the blobs: Counting by localization with point supervision, ECCV, 2008.

V. Lempitsky and A. Zisserman, Learning to count objects in images, NIPS, 2010.

Y. Li, X. Zhang, and D. Chen, Csrnet: Dilated convolutional neural networks for understanding the highly congested scenes, CVPR, vol.7, 2006.

S. Liao, Y. Hu, X. Zhu, and S. Z. Li, Person re-identification by local maximal occurrence representation and metric learning, CVPR, 2015.

J. Liu, C. Gao, D. Meng, and A. G. Hauptmann, Decidenet: Counting varying density crowds through attention guided detection and density estimation, CVPR, 2008.

W. Liu, D. Anguelov, D. Erhan, C. Szegedy, S. Reed et al., Ssd: Single shot multibox detector, ECCV, vol.2, p.3, 2016.

X. Liu, J. Weijer, and A. D. Bagdanov, Leveraging unlabeled data for crowd counting by learning to rank, CVPR, p.7, 2006.

Z. Lu, M. Shi, and Q. Chen, Crowd counting via scale-adaptive convolutional neural network, In WACV, vol.1, issue.2, 2018.

M. Najibi, P. Samangouei, R. Chellappa, and L. Davis, Ssh: Single stage headless face detector, ICCV, 2017.

D. Onoro, -. Rubio, and R. , Towards perspective-free object counting with deep learning, EC-CV, vol.1, 2016.

P. Dim, . Papadopoulos, R. R. Jasper, F. Uijlings, V. Keller et al., Extreme clicking for efficient object annotation, ICCV, pp.4940-4949, 2017.

V. Rabaud and S. Belongie, Counting crowded moving objects, CVPR, 2006.

D. Ramanan, Learning to parse images of articulated bodies, NIPS, 2007.

H. Viresh-ranjan, M. Le, and . Hoai, Iterative crowd counting, ECCV, vol.2, p.6, 2018.

K. Shaoqing-ren, R. He, J. Girshick, and . Sun, Faster r-cnn: Towards real-time object detection with region proposal networks, NIPS, vol.7, 2006.

M. Rodriguez, I. Laptev, J. Sivic, and J. Audibert, Density-aware person detection and tracking in crowds, ICCV, 2011.
URL : https://hal.archives-ouvertes.fr/hal-00654266

O. Ronneberger, P. Fischer, and T. Brox, U-net: Convolutional networks for biomedical image segmentation, MICCAI, 2015.

S. Deepak-babu-sam, R. Surya, and . Babu, Switching convolutional neural network for crowd counting, CVPR, p.7, 2006.

B. Sapp and B. Taskar, Modec: Multimodal decomposable models for human pose estimation, CVPR, 2013.

F. Schroff, D. Kalenichenko, and J. Philbin, Facenet: A unified embedding for face recognition and clustering, CVPR, 2015.

M. Shi and V. Ferrari, Weakly supervised object localization using size estimates, ECCV, 2016.

M. Shi, Z. Yang, C. Xu, and Q. Chen, Revisiting perspective information for efficient crowd counting, CVPR, 2004.
URL : https://hal.archives-ouvertes.fr/hal-01831109

A. Shrivastava, A. Gupta, and R. Girshick, Training region-based object detectors with online hard example mining, CVPR, 2016.

A. Vishwanath, . Sindagi, M. Vishal, and . Patel, Generating highquality crowd density maps using contextual pyramid cnns, ICCV, p.7, 2006.

R. Stewart, M. Andriluka, and A. Ng, End-to-end people detection in crowded scenes, CVPR, 2016.

P. Viola, J. Michael, D. Jones, and . Snow, Detecting pedestrians using patterns of motion and appearance, IJCV, vol.63, issue.2, pp.153-161, 2003.

C. Wah and S. Branson, Pietro Perona, and Serge Belongie. Multiclass recognition and part localization with humans in the loop, ICCV, 2011.

T. Wang, B. Han, and J. Collomosse, Touchcut: Fast image and video segmentation using single-touch interaction, Computer Vision and Image Understanding, vol.120, issue.3, pp.14-30, 2014.

S. Yang, P. Luo, C. Loy, and X. Tang, Wider face: A face detection benchmark, CVPR, vol.2, p.5, 2016.

C. Zhang, H. Li, X. Wang, and X. Yang, Cross-scene crowd counting via deep convolutional neural networks, CVPR, vol.2, p.4, 2015.

X. Zhang, J. Feng, H. Xiong, and Q. Tian, Zigzag learning for weakly supervised object detection, CVPR, 2018.

Y. Zhang, D. Zhou, S. Chen, S. Gao, and Y. Ma, Single-image crowd counting via multi-column convolutional neural network, CVPR, 2005.