Abstract
Crowd counting is getting more and more attention. More and more collective activities, such as the Olympics Games and the World Expo, are also important to control the crowd number. In this paper, we address the problem of crowd counting in the crowded scene. Our model accurately estimated the count of people in the crowded scene. Firstly, we proposed a novel and simple convolutional neural network, called Global Counting CNN (GCCNN). The GCCNN can learn a mapping, transforms the appearance of image patches to estimated density maps. Secondly, the Local to Global counting CNN (LGCCNN), calculating the density map from local to global. Stiching the local patches constrains the final density map of the larger area, which makes up for the difference values in the perspective map. In general, it makes the final density map more accurate. The dataset we used is a set of public dataset, which are WorldExpo’10 dataset, Shanghaitech dataset, the UCF_CC_50 dataset and the UCSD dataset. The experiments have proved our method achieves the state-of-the-art result over other algorithms.







Similar content being viewed by others
References
Chan AB, Vasconcelos N (2012) Counting people with low-level features and bayesian regression. IEEE Trans Image Process 21(4):2160–2177
Cong Z, Li H, Wang X, Yang X (2015) Cross-scene crowd counting via deep convolutional neural networks. In: IEEE conference on computer vision & pattern recognition
Dalal N, Triggs B (2005) Histograms of oriented gradients for human detection. In: Schmid C, Soatto S, Tomasi C (eds) International conference on computer vision & pattern recognition (CVPR ’05). https://doi.org/10.1109/CVPR.2005.177. https://hal.inria.fr/inria-00548512, vol 1. IEEE Computer Society, San Diego, pp 886–893
Ge W, Collins RT (2009) Marked point processes for crowd counting. In: IEEE conference on computer vision & pattern recognition
He K, Zhang X, Ren S, Jian S (2015) Delving deep into rectifiers: surpassing human-level performance on imagenet classification
Hu Y, Chang H, Nian F, Yan W, Teng L (2016) Dense crowd counting from still images with convolutional neural networks. J Vis Commun Image Represent 38(C):530–539
Ke C, Chen CL, Gong S, Tao X (2012) Feature mining for localised crowd counting. In: British machine vision conference
Lempitsky VS, Zisserman A (2010) Learning to count objects in images. In: International conference on neural information processing systems
Lin SF, Chen JY, Chao HX (2001) Estimation of number of people in crowded scenes using perspective transformation. Systems Man & Cybernetics Part A Systems & Humans IEEE Transactions on 31(6):645–654
Liu R, Chen Y, Zhu X, Hou K (2016) Image classification using label constrained sparse coding. Multimed Tools Appl 75(23):15619–15633
Liu T, Tao D (2014) On the robustness and generalization of cauchy regression. In: IEEE international conference on information science & technology
Long J, Shelhamer E, Darrell T (2015) Fully convolutional networks for semantic segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 3431–3440
Lowe DG, Lowe D (1999) Object recognition from local scale-invariant features. In: Proc. iccv
Loy CC, Gong S, Xiang T (2014) From semi-supervised to transfer counting of crowds. In: IEEE international conference on computer vision
Min L, Zhang Z, Huang K, Tan T (2009) Estimating the number of people in crowd- ed scenes by mid based foreground segmentation and head-shoulder detection. In: International conference on pattern recognition
Ojala T, Pietikinen M, Menp T (2002) Gray scale and rotation invariant texture classification with local binary patterns. IEEE Trans Pattern Anal Mach Intell 24(7):971–987
Rodriguez M, Laptev I, Sivic J, Audibert JY (2011) Density-aware person detection and tracking in crowds. In: International conference on computer vision
Sam DB, Surya S, Babu RV (2017) Switching convolutional neural network for crowd counting
Shen Z, Xu Y, Ni B, Wang M, Hu J, Yang X (2018) Crowd counting via adver- sarial cross-scale consistency pursuit. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 5245–5254
Sindagi VA, Patel VM (2017) Generating high-quality crowd density maps using contextual pyramid cnns. In: Proceedings of the IEEE international conference on computer vision, pp 1861–1870
Wu B, Nevatia R (2005) Detection of multiple, partially occluded humans in a single image by bayesian combination of edgelet part detectors. In: Tenth IEEE international conference on computer vision
Zhang Y, Zhou D, Chen S, Gao S, Yi M (2016) Single-image crowd counting via multi-column convolutional neural network. In: Computer vision & pattern recognition
Acknowledgements
This work is supported by the National Key R&D Program of China (No. 2018YFB1305804), and the Anhui Provincial Natural Science Foundation of China (No. 1908085J25).
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Chuanrui Hu and Kai Cheng are contributed equally to this work
Rights and permissions
About this article
Cite this article
Hu, C., Cheng, K., Xie, Y. et al. Arbitrary perspective crowd counting via local to global algorithm. Multimed Tools Appl 79, 15059–15071 (2020). https://doi.org/10.1007/s11042-020-08888-5
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-020-08888-5