Abstract
In today’s scenario, there are frequent events (viz. political rallies, live concerts, strikes, sports meet) occur in which many people gather to participate in the event. In crowded areas possibility of occurrence of suspicious activities and violence automatically increases. To attenuate these issues, it is very important to develop automated systems that can detect anomalies in the diverse and complex outdoor environments, especially at public places to ensure safety and to avoid crowd disasters such as human stampede, mob lynching, and riots. Crowd management may help to circumvent the crowd disaster and ensure public safety at places like temples, railway stations, airports, bus terminals, religious functions, political rallies etc. We need a visual surveillance that can automatically detect abnormality in crowd behavior so that the relevant action can be taken to prevent any public casualty. Basic steps required for crowd analysis is density estimation and crowd counting, object recognition, tracking, and anomaly detection in crowded scene. Here, we have systematically reviewed and compared different methods that are being used for crowd analysis. The comparative analysis of existing methods has been presented on different available datasets and taxonomies are also compared. We also propose our own taxonomy for crowd analysis and datasets.
Similar content being viewed by others
References
https://medium.com/@bharathvarmaavs/list-of-mob-lynching-incidents-in-india-2019-5b97773f677f. Accessed 20 Mar 2021
https://economictimes.indiatimes.com/news/politics-and-nation/mumbai-people-protest-at-marine-drive-against-caa-violence-in-northeast-delhi/videoshow/74297667.cms?from=mdr. Accessed 20 Mar 2021
https://www.cnbc.com/2019/08/05/article-370-what-is-happening-in-kashmir-india-revokes-special-status.html. Accessed 20 Mar 2021
https://www.indiatoday.in/news-analysis/story/beyond-jnu-when-university-campuses-different-jamia-amu-1634384-2020-01-06. Accessed 20 Mar 2021
https://www.theguardian.com/world/2020/jan/06/students-injured-in-india-after-masked-attackers-raid-top-university. Accessed 20 Mar 2021
https://www.thenational.ae/world/asia/a-history-of-violence-at-indian-universities-1.557030. Accessed 20 Mar 2021
Ozturk O, Yamasaki T, Aizawa K (2010) Detecting dominant motion flows in unstructured/structured crowd scenes. In: 20th international conference on pattern recognition. IEEE, pp 3533–3536
Zitouni MS, Sluzek A, Bhaskar H (2019) Visual analysis of socio-cognitive crowd behaviors for surveillance: a survey and categorization of trends and methods. Eng Appl Artif Intell 82:294–312
Zhan B, Monekosso DN, Remagnino P, Velastin SA, Xu LQ (2008) Crowd analysis: a survey. Mach Vis Appl 19(5):345–357
Junior JCSJ, Musse SR, Jung CR (2010) Crowd analysis using computer vision techniques. IEEE Signal Process Mag 27(5):66–77
Bansod SD, Nandedkar AV (2020) Crowd anomaly detection and localization using histogram of magnitude and momentum. Vis Comput 36(3):609–620
Kok VJ, Lim MK, Chan CS (2016) Crowd behavior analysis: a review where physics meets biology. Neurocomputing 177:342–362
Yogameena B, Nagananthini C (2017) Computer vision based crowd disaster avoidance system: a survey. Int J Disaster Risk Reduct 22:95–129
Zhang X, Yu Q, Yu H (2018) Physics inspired methods for crowd video surveillance and analysis: a survey. IEEE Access 6:66816–66830
Nayan N, Sahu SS, Kumar S (2019) Detecting anomalous crowd behavior using correlation analysis of optical flow. SIViP 13(6):1233–1241
Afiq AA, Zakariya MA, Saad MN, Nurfarzana AA, Khir MHM, Fadzil AF, Faizari M (2019) A review on classifying abnormal behavior in crowd scene. J Vis Commun Image Represent 58:285–303
Mehran R, Oyama A, Shah M (2009) Abnormal crowd behavior detection using social force model. In: 2009 IEEE conference on computer vision and pattern recognition. IEEE, pp 935–942
Bour P, Cribelier E, Argyriou V (2019) Crowd behavior analysis from fixed and moving cameras. In: Multimodal behavior analysis in the wild, Academic Press, Cambridge, pp 289–322
Irfan M, Marcenaro L, Tokarchuk L (2016) Crowd analysis using visual and non-visual sensors, a survey. In: 2016 IEEE global conference on signal and information processing (GlobalSIP), IEEE, pp 1249–1254
Draghici A, Steen MV (2018) A survey of techniques for automatically sensing the behavior of a crowd. ACM Comput Surv (CSUR) 51(1):1–40
Srivastava S, Khari M, Crespo RG, Chaudhary G, Arora P (2021) Concepts and real-time applications of deep learning. Springer, Cham
Ko T (2008) A survey on behavior analysis in video surveillance for homeland security applications. In: 2008 37th IEEE applied imagery pattern recognition workshop, IEEE, pp 1–8
Candamo J, Shreve M, Goldgof DB, Sapper DB, Kasturi R (2009) Understanding transit scenes: a survey on human behavior-recognition algorithms. IEEE Trans Intell Transp Syst 11(1):206–224
Kim IS, Choi HS, Yi KM, Choi JY, Kong SG (2010) Intelligent visual surveillance—a survey. Int J Control Autom Syst 8(5):926–939
Ko T (2011) A survey on behaviour analysis in video surveillance applications. In: Video surveillance, InTech, London, pp 279–294
Muhammadi J, Rabiee HR, Hosseini A (2013) Crowd labeling: a survey. arXiv preprint arXiv:1301.2774
Gowsikhaa D, Abirami S, Baskaran R (2014) Automated human behavior analysis from surveillance videos: a survey. Artif Intell Rev 42(4):747–765
Li T, Chang H, Wang M, Ni B, Hong R, Yan S (2014) Crowded scene analysis: a survey. IEEE Trans Circuits Syst Video Technol 25(3):367–386
Khan MT, Ali A, Durrani MY, Siddiqui I (2015) Survey of holistic crowd analysis models. J Comput Sci Commun 1(1):1–9
Ijaz K, Sohail S, Hashish S (2015) A survey of latest approaches for crowd simulation and modeling using hybrid techniques. In: 17th UKSIMAMSS international conference on modelling and simulation, pp 111–116
Zitouni MS, Bhaskar H, Dias J, Al-Mualla ME (2016) Advances and trends in visual crowd analysis: a systematic survey and evaluation of crowd modelling techniques. Neurocomputing 186:139–159
Zhang Y, Ma Y (2016) Vision-based human behavior recognition and behavior understanding: a survey. In: 川 4th international conference on machinery, materials and computing technology (ICMMCT). Hangzhou, China: ICMMCT, pp 1899–1902
Swathi HY, Shivakumar G, Mohana HS (2017) Crowd behavior analysis: a survey. In: 2017 international conference on recent advances in electronics and communication technology (ICRAECT), IEEE, pp 169–178
Lamba S, Nain N (2017) Crowd monitoring and classification: a survey. In: Advances in computer and computational sciences. Springer, Singapore, pp 21–31
Ma J, Dai Y, Hirota K (2017) A survey of video-based crowd anomaly detection in dense scenes. J Adv Comput Intell Intell Inform 21(2):235–246
Sindagi VA, Patel VM (2018) A survey of recent advances in cnn-based single image crowd counting and density estimation. Pattern Recogn Lett 107:3–16
Savitha C, Ramesh D (2018) Motion detection in video surviellance: a systematic survey. In: 2018 2nd international conference on inventive systems and control (ICISC), IEEE, pp 51–54
Tripathi G, Singh K, Vishwakarma DK (2019) Convolutional neural networks for crowd behaviour analysis: a survey. Vis Comput 35(5):753–776
Sreenu G, Durai MS (2019) Intelligent video surveillance: a review through deep learning techniques for crowd analysis. J Big Data 6(1):1–27
Elbishlawi S, Abdelpakey MH, Eltantawy A, Shehata MS, Mohamed MM (2020) Deep learning-based crowd scene analysis survey. J Imaging 6(9):95
Abdou M, Erradi A (2020) Crowd counting: a survey of machine learning approaches. In: 2020 IEEE international conference on informatics, IoT, and enabling technologies (ICIoT), IEEE, pp 48–54
Luo Y, Lu J, Zhang B (2020) Crowd counting for static images: a survey of methodology. In: 2020 39th Chinese control conference (CCC), IEEE, pp 6602–6607
Jingying W (2021) A survey on crowd counting methods and datasets. In: Advances in computer, communication and computational sciences. Springer, Singapore, pp 851–863
Bendali-Braham M, Weber J, Forestier G, Idoumghar L, Muller PA (2021) Recent trends in crowd analysis: a review. Mach Learn Appl 4:100023
Baccouche M, Mamalet F, Wolf C, Garcia C, Baskurt A (2011) Sequential deep learning for human action recognition. In: International workshop on human behavior understanding. Springer, Berlin, pp 29–39
Marsden M, McGuinness K, Little S, O'Connor NE (2017) Resnetcrowd: a residual deep learning architecture for crowd counting, violent behaviour detection and crowd density level classification. In: 2017 14th IEEE international conference on advanced video and signal based surveillance (AVSS), IEEE, pp 1–7
Sindagi VA, Patel VM (2017) Cnn-based cascaded multi-task learning of high-level prior and density estimation for crowd counting. In: 14th IEEE international conference on advanced video and signal based surveillance (AVSS), pp 1–6
Tran D, Wang H, Torresani L, Ray J, LeCun Y, Paluri M (2018) A closer look at spatiotemporal convolutions for action recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 6450–6459
Carreira J, Zisserman A (2017) Quo vadis, action recognition? A new model and the kinetics dataset. In: proceedings of the IEEE conference on computer vision and pattern recognition, pp 6299–6308
You Q, Jiang H (2018) Action4d: real-time action recognition in the crowd and clutter. arXiv preprint arXiv:1806.02424
Wojke N, Bewley A, Paulus D (2017) Simple online and realtime tracking with a deep association metric. In: 2017 IEEE international conference on image processing (ICIP), IEEE, pp 3645–3649
Singh A, Patil D, Omkar SN (2018) Eye in the sky: real-time drone surveillance system (DSS) for violent individuals identification using scatternet hybrid deep learning network. In: Proceedings of the IEEE conference on computer vision and pattern recognition workshops, pp 1629–1637
Ravanbakhsh M, Nabi M, Mousavi H, Sangineto E, Sebe N (2018) Plug-and-play cnn for crowd motion analysis: an application in abnormal event detection. In: 2018 IEEE winter conference on applications of computer vision (WACV), IEEE, pp 1689–1698
Vahora SA, Chauhan NC (2019) Deep neural network model for group activity recognition using contextual relationship. Eng Sci Technol 22(1):47–54
Shu T, Todorovic S, Zhu SC (2017) CERN: confidence-energy recurrent network for group activity recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 5523–5531
Wei X, Du J, Xue Z, Liang M, Geng Y, Xu X, Lee J (2020) A very deep two-stream network for crowd type recognition. Neurocomputing 396:522–533
Yan L, Zhu M, Yu C (2019) Crowd video captioning. arXiv preprint arXiv:1911.05449
Ullah H, Khan SD, Ullah M, Cheikh FA, Uzair M (2019) Two stream model for crowd video classification. In: 2019 8th European workshop on visual information processing (EUVIP), IEEE, pp 93–98
Wan J, Chan A (2019) Adaptive density map generation for crowd counting. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 1130–1139
Wang Q, Gao J, Lin W, Yuan Y (2019) Learning from synthetic data for crowd counting in the wild. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 8198–8207
Alahi A, Goel K, Ramanathan V, Robicquet A, Fei-Fei L, Savarese S (2016) Social lstm: human trajectory prediction in crowded spaces. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 961–971
Bartoli F, Lisanti G, Seidenari L, Del Bimbo A (2017) PACE: prediction-based annotation for crowded environments. In: Proceedings of the 2017 ACM on international conference on multimedia retrieval, pp 121–124
Bisagno N, Zhang B, Conci N (2018) Group lstm: group trajectory prediction in crowded scenarios. In: Proceedings of the European conference on computer vision (ECCV) workshops
Singh K, Rajora S, Vishwakarma DK, Tripathi G, Kumar S, Walia GS (2020) Crowd anomaly detection using aggregation of ensembles of fine-tuned convnets. Neurocomputing 371:188–198
Lin S, Yang H, Tang X, Shi T, Chen L (2019) Social mil: interaction-aware for crowd anomaly detection. In: 2019 16th IEEE international conference on advanced video and signal based surveillance (AVSS), IEEE, pp 1–8
Dalal R, Khari M, Garg A, Gupta D, Gautam A (2021) False media detection by using deep learning. In: Multimodal biometric systems. CRC Press, Boca Raton, pp 79–88
Pillai MS, Chaudhary G, Khari M, Crespo RG (2021) Real-time image enhancement for an automatic automobile accident detection through CCTV using deep learning. Soft Comput 25(18):11929–11940
Rajagopal A, Joshi GP, Ramachandran A, Subbalakshmi R, Khari M, Jha S, Shankar K, You J (2020) A deep learning model based on multi-objective particle swarm optimization for scene classification in unmanned aerial vehicles. IEEE Access 8:135383–135393
LeCun Y (2015) LeNet-5, convolutional neural networks. 20(5):14. http://yann.lecun.com/exdb/lenet〹
Simonyan K, Zisserman A (2015) Vggnet. In: 3rd Int. Conf. Learn. Represent. ICLR
Szegedy C, Liu W, Jia Y, Sermanet P, Reed S, Anguelov D et al (2015) Going deeper with convolutions. In: Proceedings of the IEEE conference on computer vision and pattern recognition. pp 1–9
Targ S, Almeida D, Lyman K (2016) Resnet in resnet: generalizing residual architectures. arXiv preprint arXiv:1603.08029
Bhangale U, Patil S, Vishwanath V, Thakker P, Bansode A, Navandhar D (2020) Near real-time crowd counting using deep learning approach. Procedia Comput Sci 171:770–779
Sagar A (2020) Bayesian multi scale neural network for crowd counting. arXiv preprint arXiv:2007.14245
Chen Y (2020) Crowd behaviour recognition using enhanced butterfly optimization algorithm based recurrent neural network. Multimed Res. https://doi.org/10.46253/j.mr.v3i3.a3
Murugesan M, Thilagamani S (2020) Efficient anomaly detection in surveillance videos based on multi layer perception recurrent neural network. Microprocess Microsyst 79:103303
Hochreiter S, Schmidhuber J (1996) LSTM can solve hard long time lag problems. Adv Neural Inf Process Syst 9:473–479
Li B, Su W (2021) Long short-term memory network-based user behavior analysis in virtual reality training system—a case study of the ship communication and navigation equipment training. Arab J Geosci 14(1):1–8
Amrutha CV, Jyotsna C, Amudha J (2020) Deep learning approach for suspicious activity detection from surveillance video. In: 2020 2nd international conference on innovative mechanisms for industry applications (ICIMIA), IEEE, pp 335–339
Mahadevan V, Li W, Bhalodia V, Vasconcelos N (2010) Anomaly detection in crowded scenes. In: 2010 IEEE computer society conference on computer vision and pattern recognition, IEEE, pp 1975–1981
Ferryman J, Shahrokni A (2009) Pets2009: dataset and challenge. In: 2009 twelfth IEEE international workshop on performance evaluation of tracking and surveillance, IEEE, pp 1–6
Hassner T, Itcher Y, Kliper-Gross O (2012) Violent flows: real-time detection of violent crowd behavior. In: 2012 IEEE computer society conference on computer vision and pattern recognition workshops, IEEE, pp 1–6
Zhang Y, Zhou D, Chen S, Gao S, Ma Y (2016) Single-image crowd counting via multi-column convolutional neural network. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 589–597
Soomro K, Zamir AR, Shah M (2012) UCF101: a dataset of 101 human actions classes from videos in the wild. arXiv preprint arXiv:1212.0402
Wu Y, Lim J, Yang MH (2013) Online object tracking: a benchmark. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2411–2418
Milan A, Leal-Taixé L, Reid I, Roth S, Schindler K (2016) MOT16: a benchmark for multi-object tracking. arXiv preprint arXiv:1603.00831
Sultani W, Chen C, Shah M (2018) Real-world anomaly detection in surveillance videos. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 6479–6488
Fang Y, Zhan B, Cai W, Gao S, Hu B (2019) Locality-constrained spatial transformer network for video crowd counting. In: 2019 IEEE international conference on multimedia and expo (ICME), IEEE, pp 814–819
Chen K, Loy CC, Gong S, Xiang T (2012) Feature mining for localised crowd counting. Bmvc 1(2):3
Shao J, Kang K, Change Loy C, Wang X (2015) Deeply learned attributes for crowded scene understanding. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 4657–4666
Ucf-cc-50. https://www.crcv.ucf.edu/data/UCFCrowdsDataset.zip
Amir Sjarif NN, Shamsuddin SM, Mohd Hashim SZ, Yuhaniz SS (2011) Crowd analysis and its applications. International conference on software engineering and computer systems. Springer, Berlin, pp 687–697
Di Stefano L, Neri G, Viarani E (2001) Analysis of pixel-level algorithms for video surveillance applications. In: Proceedings 11th international conference on image analysis and processing, IEEE, pp 541–546
Davies AC, Yin JH, Velastin SA (1995) Crowd monitoring using image processing. Electron Commun Eng J 7(1):37–47
Regazzoni CS, Tesei A, Murino V (1993) A real-time vision system for crowding monitoring. In: Proceedings of IECON'93–19th annual conference of IEEE industrial electronics, IEEE, pp 1860–1864
Cho SY, Chow TW, Leung CT (1999) A neural-based crowd estimation by hybrid global learning algorithm. IEEE Trans Syst Man Cybern B 29(4):535–541
Yang DB, González-Banos HH, Guibas LJ (2003) Counting people in crowds with a real-time network of simple image sensors. ICCV 3:122
Wu X, Liang G, Lee KK, Xu Y (2006) Crowd density estimation using texture analysis and learning. In: 2006 IEEE international conference on robotics and biomimetics, IEEE, pp 214–219
Marana AN, Costa LDF, Lotufo RA, Velastin SA (1998) On the efficacy of texture analysis for crowd monitoring. In: Proceedings SIBGRAPI'98. international symposium on computer graphics, image processing, and vision (cat. no. 98EX237), IEEE, pp 354–361
Rahmalan H, Nixon MS, Carter JN (2006) On crowd density estimation for surveillance. In: IET conference on crime and security, IET, pp 540–545
Chan AB, Liang ZSJ, Vasconcelos N (2008) Privacy preserving crowd monitoring: counting people without people models or tracking. In: 2008 IEEE conference on computer vision and pattern recognition, IEEE, pp 1–7
Lin SF, Chen JY, Chao HX (2001) Estimation of number of people in crowded scenes using perspective transformation. IEEE Trans Syst Man Cybern A 31(6):645–654
Zhao T, Nevatia R (2003) Bayesian human segmentation in crowded situations. In: 2003 IEEE computer society conference on computer vision and pattern recognition, 2003. Proceedings, vol 2, IEEE, pp II-459
Leibe B, Seemann E, Schiele B (2005) Pedestrian detection in crowded scenes. In: 2005 IEEE computer society conference on computer vision and pattern recognition (CVPR'05), vol 1, IEEE, pp 878–885
Rabaud V, Belongie S (2006) Counting crowded moving objects. In: 2006 IEEE computer society conference on computer vision and pattern recognition (CVPR'06), vol 1, IEEE, pp 705–711
Ilyas N, Shahzad A, Kim K (2019) Convolutional-neural network-based image crowd counting: review, categorization, analysis, and performance evaluation. Sensors 20(1):43
Fu M, Xu P, Li X, Liu Q, Ye M, Zhu C (2015) Fast crowd density estimation with convolutional neural networks. Eng Appl Artif Intell 43:81–88
Wang C, Zhang H, Yang L, Liu S, Cao X (2015) Deep people counting in extremely dense crowds. In: Proceedings of the 23rd ACM international conference on Multimedia, pp 1299–1302
Walach E, Wolf L (2016) Learning to count with CNN boosting. In: European conference on computer vision. Springer, Cham, pp 660–676
Liu M, Jiang J, Guo Z, Wang Z, Liu Y (2018) Crowd counting with fully convolutional neural network. In: 2018 25th IEEE international conference on image processing (ICIP), IEEE, pp 953–957
Babu Sam D, Surya S, Venkatesh Babu R (2017) Switching convolutional neural network for crowd counting. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 5744–5752
Zhang L, Shi M, Chen Q (2018) Crowd counting via scale-adaptive convolutional neural network. In: 2018 IEEE winter conference on applications of computer vision (WACV), IEEE, pp 1113–1121
Chen X, Bin Y, Sang N, Gao C (2019) Scale pyramid network for crowd counting. In: 2019 IEEE winter conference on applications of computer vision (WACV), IEEE, pp 1941–1950
Boominathan L, Kruthiventi SS, Babu RV (2016) Crowdnet: a deep convolutional network for dense crowd counting. In: Proceedings of the 24th ACM international conference on Multimedia, pp 640–644
Hossain M, Hosseinzadeh M, Chanda O, Wang Y (2019) Crowd counting using scale-aware attention networks. In: 2019 IEEE winter conference on applications of computer vision (WACV), IEEE, pp 1280–1288
Cao X, Wang Z, Zhao Y, Su F (2018) Scale aggregation network for accurate and efficient crowd counting. In: Proceedings of the European conference on computer vision (ECCV), pp 734–750
Kang D, Chan A (2018) Crowd counting by adaptively fusing predictions from an image pyramid. arXiv preprint arXiv:1805.06115
Shang C, Ai H, Bai B (2016) End-to-end crowd counting via joint learning local and global count. In: 2016 IEEE international conference on image processing (ICIP), IEEE, pp 1215–1219
Sheng B, Shen C, Lin G, Li J, Yang W, Sun C (2016) Crowd counting via weighted VLAD on a dense attribute feature map. IEEE Trans Circuits Syst Video Technol 28(8):1788–1797
Li Y, Zhang X, Chen D (2018) Csrnet: dilated convolutional neural networks for understanding the highly congested scenes. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1091–1100
Sindagi VA, Patel VM (2017) Generating high-quality crowd density maps using contextual pyramid cnns. In: Proceedings of the IEEE international conference on computer vision, pp 1861–1870
Zhao Z, Li H, Zhao R, Wang X (2016) Crossing-line crowd counting with two-phase deep neural networks. In: European conference on computer vision. Springer, Cham, pp 712–726
[link]. http://cvlab.uestc.edu.cn/CDECNN/DatasetChunxiRoad.zip
Ma W, Huang L, Liu C (2008) Crowd estimation using multi-scale local texture analysis and confidence-based soft classification. In: 2008 second international symposium on intelligent information technology application, vol 1, IEEE, pp 142–146
Ma W, Huang L, Liu C (2010) Crowd density analysis using co-occurrence texture features. In: 5th international conference on computer sciences and convergence information technology, IEEE, pp 170–175
Zhang C, Li H, Wang X, Yang X (2015) Cross-scene crowd counting via deep convolutional neural networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 833–841
Zeng X, Wu Y, Hu S, Wang R, Ye Y (2020) DSPNet: deep scale purifier network for dense crowd counting. Expert Syst Appl 141:112977
Ranjan V, Le H, Hoai M (2018) Iterative crowd counting. In: Proceedings of the European conference on computer vision (ECCV), pp 270–285
Gao J, Wang Q, Li X (2019) Pcc net: perspective crowd counting via spatial convolutional network. IEEE Trans Circuits Syst Video Technol 30(10):3486–3498
Guo D, Li K, Zha ZJ, Wang M (2019) Dadnet: dilated-attention-deformable convnet for crowd counting. In: Proceedings of the 27th ACM international conference on multimedia, pp 1823–1832
Dai F, Liu H, Ma Y, Zhang X, Zhao Q (2021) Dense scale network for crowd counting. In: Proceedings of the 2021 international conference on multimedia retrieval, pp 64–72
Cenggoro TW, Aslamiah AH, Yunanto A (2019) Feature pyramid networks for crowd counting. Procedia Comput Sci 157:175–182
Zhang Y, Zhou C, Chang F, Kot AC (2019) Multi-resolution attention convolutional neural network for crowd counting. Neurocomputing 329:144–152
Sindagi VA, Patel VM (2019) Ha-ccn: hierarchical attention-based crowd counting network. IEEE Trans Image Process 29:323–335
Liu L, Lu H, Zou H, Xiong H, Cao Z, Shen C (2020) Weighing counts: sequential crowd counting by reinforcement learning. In: European conference on computer vision. Springer, Cham, pp 164–181
Shi X, Li X, Wu C, Kong S, Yang J, He L (2020) A real-time deep network for crowd counting. In: ICASSP 2020–2020 IEEE international conference on acoustics, speech and signal processing (ICASSP), IEEE, pp 2328–2332
Luo A, Yang F, Li X, Nie D, Jiao Z, Zhou S, Cheng H (2020) Hybrid graph neural networks for crowd counting. Proc AAAI Conf Artif Intell 34(7):11693–11700
Zhou JT, Zhang L, Jiawei D, Peng X, Fang Z, Xiao Z, Zhu H (2021) Locality-aware crowd counting. IEEE Trans Pattern Anal Mach Intell. https://doi.org/10.1109/TPAMI.2021.3056518
Xiong F, Shi X, Yeung DY (2017) Spatiotemporal modeling for crowd counting in videos. In: Proceedings of the IEEE international conference on computer vision, pp 5151–5159
Gao J, Yuan Y, Wang Q (2020) Feature-aware adaptation and density alignment for crowd counting in video surveillance. IEEE Trans Cybern 51(10):4822–4833
Idrees H, Tayyab M, Athrey K, Zhang D, Al-Maadeed S, Rajpoot N, Shah M (2018) Composition loss for counting, density map estimation and localization in dense crowds. In: Proceedings of the European conference on computer vision (ECCV), pp 532–546
Wang P, Gao C, Wang Y, Li H, Gao Y (2020) MobileCount: an efficient encoder-decoder framework for real-time crowd counting. Neurocomputing 407:292–299
Huang G, Liu Z, Van Der Maaten L, Weinberger KQ (2017) Densely connected convolutional networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 4700–4708
Yilmaz A, Javed O, Shah M (2006) Object tracking: a survey. Acm Comput Surv (CSUR) 38(4):13-es
Huang Y, Huang TS, Niemann H (2002) A region-based method for model-free object tracking. In: Object recognition supported by user interaction for service robots, vol 1, IEEE, pp 592–595
Schmaltz C, Rosenhahn B, Brox T, Cremers D, Weickert J, Wietzke L, Sommer G (2007). Region-based pose tracking. In: Iberian conference on pattern recognition and image analysis, Springer, Berlin, pp 56–63
Lee H, Kim D (2018) Salient region-based online object tracking. In: 2018 IEEE winter conference on applications of computer vision (WACV), IEEE, pp 1170–1177
Hang Y, Derong C, Jiulu G (2018) Object tracking using both a kernel and a non-parametric active contour model. Neurocomputing 295:108–117
Sun W, Dong E (2019) Kullback-Leibler distance and graph cuts based active contour model for local segmentation. Biomed Signal Process Control 52:120–127
Tang F, Tao H (2005) Object tracking with dynamic feature graph. In: 2005 IEEE international workshop on visual surveillance and performance evaluation of tracking and surveillance, IEEE, pp 25–32
Yuan D, Zhang X, Liu J, Li D (2019) A multiple feature fused model for visual object tracking via correlation filters. Multimed Tools Appl 78(19):27271–27290
Tsai YT, Shih HC, Huang CL (2006) Multiple human objects tracking in crowded scenes. In: 18th international conference on pattern recognition (ICPR'06), vol 3, IEEE, pp 51–54
Chun J, Park S (2018) RGB-D model based human detection and tracking using 3D CCTV. In: 2018 9th IEEE annual ubiquitous computing, electronics & mobile communication conference (UEMCON), IEEE, pp 758–762
Soleimanitaleb Z, Keyvanrad MA, Jafari A (2019) Object tracking methods: a review. In: 2019 9th international conference on computer and knowledge engineering (ICCKE), IEEE, pp 282–288
Bu F, Cai Y, Yang Y (2016) Multiple object tracking based on faster-RCNN detector and KCF tracker. [Online]: http://www-personal.umich.edu/~yeeyoung/publication/MOT.pdf
Chahyati D, Fanany MI, Arymurthy AM (2017) Tracking people by detection using CNN features. Procedia Comput Sci 124:167–172
Agarwal A, Suryavanshi S (2017) Real-time* multiple object tracking (MOT) for autonomous navigation. Tech. rep
Nam H, Han B (2016) Learning multi-domain convolutional neural networks for visual tracking. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 4293–4302
Han B, Sim J, Adam H (2017) Branchout: regularization for online ensemble tracking with convolutional neural networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 3356–3365
Chen K, Tao W (2017) Once for all: a two-flow convolutional neural network for visual tracking. IEEE Trans Circuits Syst Video Technol 28(12):3377–3386
Wang X, Li C, Luo B, Tang J (2018). Sint++: robust visual tracking via adversarial positive instance generation. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 4864–4873
Tao R, Gavves E, Smeulders AW (2016) Siamese instance search for tracking. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1420–1429
Held D, Thrun S, Savarese S (2016) Learning to track at 100 fps with deep regression networks. In: European conference on computer vision. Springer, Cham, pp 749–765
Wang T, Ling H (2017) Gracker: a graph-based planar object tracker. IEEE Trans Pattern Anal Mach Intell 40(6):1494–1501
Nam H, Baek M, Han B (2016) Modeling and propagating cnns in a tree structure for visual tracking. arXiv preprint arXiv:1608.07242
Gomila C, Meyer F (2003) Graph-based object tracking. In: Proceedings 2003 international conference on image processing (cat. no. 03CH37429), vol 2, IEEE, pp II-41
Chen L, Peng X, Ren M (2018) Recurrent metric networks and batch multiple hypothesis for multi-object tracking. IEEE Access 7:3093–3105
Maksai A, Fua P (2018) Eliminating exposure bias and loss-evaluation mismatch in multiple object tracking. arXiv preprint arXiv:1811.10984
Son J, Baek M, Cho M, Han B (2017). Multi-object tracking with quadruplet convolutional neural networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 5620–5629
Wang B, Wang L, Shuai B, Zuo Z, Liu T, Luk Chan K, Wang G (2016) Joint learning of convolutional neural networks and temporally constrained metrics for tracklet association. In: Proceedings of the IEEE conference on computer vision and pattern recognition workshops, pp 1–8
Leal-Taixé L, Canton-Ferrer C, Schindler K (2016) Learning by tracking: siamese CNN for robust target association. In: Proceedings of the IEEE conference on computer vision and pattern recognition workshops, pp 33–40
Chu P, Fan H, Tan CC, Ling H (2019) Online multi-object tracking with instance-aware tracker and dynamic model refreshment. In: 2019 IEEE winter conference on applications of computer vision (WACV), IEEE, pp 161–170
Xiang J, Zhang G, Hou J (2019) Online multi-object tracking based on feature representation and Bayesian filtering within a deep learning architecture. IEEE Access 7:27923–27935
Yoon K, Kim DY, Yoon YC, Jeon M (2019) Data association for multi-object tracking via deep neural networks. Sensors 19(3):559
Jiang Y, Shin H, Ko H (2018) Precise regression for bounding box correction for improved tracking based on deep reinforcement learning. In: 2018 IEEE international conference on acoustics, speech and signal processing (ICASSP), IEEE, pp 1643–1647
Ren L, Lu J, Wang Z, Tian Q, Zhou J (2018) Collaborative deep reinforcement learning for multi-object tracking. In: Proceedings of the European conference on computer vision (ECCV), pp 586–602
Fang K, Xiang Y, Li X, Savarese S (2018) Recurrent autoregressive networks for online multi-object tracking. In: 2018 IEEE winter conference on applications of computer vision (WACV), IEEE, pp 466–475
Zhou H, Ouyang W, Cheng J, Wang X, Li H (2018) Deep continuous conditional random fields with asymmetric inter-object constraints for online multi-object tracking. IEEE Trans Circuits Syst Video Technol 29(4):1011–1022
Chen L, Ai H, Shang C, Zhuang Z, Bai B (2017) Online multi-object tracking with convolutional neural networks. In: 2017 IEEE international conference on image processing (ICIP), IEEE, pp 645–649
Sadeghian A, Alahi A, Savarese S (2017) Tracking the untrackable: learning to track multiple cues with long-term dependencies. In: Proceedings of the IEEE international conference on computer vision, pp 300–311
Yang M, Wu Y, Jia Y (2017) A hybrid data association framework for robust online multi-object tracking. IEEE Trans Image Process 26(12):5667–5679
Chu Q, Ouyang W, Li H, Wang X, Liu B, Yu N (2017) Online multi-object tracking using CNN-based single object tracker with spatial-temporal attention mechanism. In: Proceedings of the IEEE international conference on computer vision, pp 4836–4845
Bae SH, Yoon KJ (2017) Confidence-based data association and discriminative deep appearance learning for robust online multi-object tracking. IEEE Trans Pattern Anal Mach Intell 40(3):595–610
Wang L, Xu L, Kim MY, Rigazico L, Yang MH (2017) Online multiple object tracking via flow and convolutional features. In: 2017 IEEE international conference on image processing (ICIP), IEEE, pp 3630–3634
Milan A, Rezatofighi SH, Dick A, Reid I, Schindler K (2017) Online multi-target tracking using recurrent neural networks. In: Thirty-first AAAI conference on artificial intelligence
Kim C, Li F, Ciptadi A, Rehg JM (2015) Multiple hypothesis tracking revisited. In: Proceedings of the IEEE international conference on computer vision, pp 4696–4704
Kim C, Li F, Rehg JM (2018) Multi-object tracking with neural gating using bilinear lstm. In: Proceedings of the European conference on computer vision (ECCV), pp 200–215
Babaee M, Li Z, Rigoll G (2018) Occlusion handling in tracking multiple people using RNN. In: 2018 25th IEEE international conference on image processing (ICIP), IEEE, pp 2715–2719
Sheng H, Zhang Y, Chen J, Xiong Z, Zhang J (2018) Heterogeneous association graph fusion for target association in multiple object tracking. IEEE Trans Circuits Syst Video Technol 29(11):3269–3280
Wen L, Du D, Li S, Bian X, Lyu S (2019) Learning non-uniform hypergraph for multi-object tracking. Proc AAAI Conf Artif Intell 33(1):8981–8988
Henschel R, Leal-Taixé L, Cremers D, Rosenhahn B (2018) Fusion of head and full-body detectors for multi-object tracking. In: Proceedings of the IEEE conference on computer vision and pattern recognition workshops, pp 1428–1437
Ma C, Yang C, Yang F, Zhuang Y, Zhang Z, Jia H, Xie X (2018) Trajectory factory: tracklet cleaving and re-connection by deep siamese bi-gru for multiple object tracking. In: 2018 IEEE international conference on multimedia and expo (ICME), IEEE, pp 1–6
Ma L, Tang S, Black MJ, Van Gool L (2018) Customized multi-person tracker. In: Asian conference on computer vision. Springer, Cham, pp 612–628
Chen J, Sheng H, Zhang Y, Xiong Z (2017) Enhancing detection model for multiple hypothesis tracking. In: Proceedings of the IEEE conference on computer vision and pattern recognition workshops, pp 18–27
Tang S, Andriluka M, Andres B, Schiele B (2017) Multiple people tracking by lifted multicut and person re-identification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 3539–3548
Gan W, Wang S, Lei X, Lee MS, Kuo CCJ (2018) Online CNN-based multiple object tracking with enhanced model updates and identity association. Signal Process: Image Commun 66:95–102
Zhu J, Yang H, Liu N, Kim M, Zhang W, Yang MH (2018) Online multi-object tracking with dual matching attention networks. In: Proceedings of the European conference on computer vision (ECCV), pp 366–382
Chen L, Ai H, Zhuang Z, Shang C (2018) Real-time multiple people tracking with deeply learned candidate selection and person re-identification. In: 2018 IEEE international conference on multimedia and expo (ICME), IEEE, pp 1–6
Sanchez-Matilla R, Poiesi F, Cavallaro A (2016) Online multi-target tracking with strong and weak detections. In: European conference on computer vision. Springer, Cham, pp 84–99
Kim M, Alletto S, Rigazio L (2016) Similarity mapping with enhanced siamese network for multi-object tracking. arXiv preprint arXiv:1609.09156
Liang Y, Zhou Y (2018) LSTM multiple object tracker combining multiple cues. In: 2018 25th IEEE international conference on image processing (ICIP), IEEE, pp 2351–2355
Lee S, Kim E (2018) Multiple object tracking via feature pyramid siamese networks. IEEE Access 7:8181–8194
Fu Z, Angelini F, Naqvi SM, Chambers JA (2018) Gm-phd filter based online multiple human tracking using deep discriminative correlation matching. In: 2018 IEEE international conference on acoustics, speech and signal processing (ICASSP), IEEE, pp 4299–4303
Ahsan U, Sun C, Hays J, Essa I (2017) Complex event recognition from images with few training examples. In: 2017 IEEE winter conference on applications of computer vision (WACV), IEEE, pp 669–678
Sabokrou M, Fayyaz M, Fathy M, Klette R (2017) Deep-cascade: cascading 3d deep neural networks for fast anomaly detection and localization in crowded scenes. IEEE Trans Image Process 26(4):1992–2004
Feng Y, Yuan Y, Lu X (2017) Learning deep event models for crowd anomaly detection. Neurocomputing 219:548–556
Ding T, Sznaier M, Camps OI (2008) Fast track matching and event detection. In: 2008 IEEE conference on computer vision and pattern recognition, IEEE, pp 1–8
Cheriyadat AM, Radke RJ (2008) Detecting dominant motions in dense crowds. IEEE J Sel Top Signal Process 2(4):568–581
Chang MB, Ullman T, Torralba A, Tenenbaum JB (2016) A compositional object-based approach to learning physical dynamics. arXiv preprint arXiv:1612.00341
Wang G, Fu H, Liu Y (2016) Real time abnormal crowd behavior detection based on adjacent flow location estimation. In: 2016 4th international conference on cloud computing and intelligence systems (CCIS), IEEE, pp 476–479
Wang L, Dong M (2012) Real-time detection of abnormal crowd behavior using a matrix approximation-based approach. In: 2012 19th IEEE international conference on image processing, IEEE, pp 2701–2704
Chibloun A, El Fkihi S, Mliki H, Hammami M, Thami ROH (2018) Abnormal crowd behavior detection using speed and direction models. In: 2018 9th international symposium on signal, image, video and communications (ISIVC), IEEE, pp 197–202
Zhang Y, Dong L, Xia Z, Li J (2015) Evolution of crowd behavior in surveillance video. In: 2015 IEEE international symposium on broadband multimedia systems and broadcasting, IEEE, pp 1–4
Andersson M, Rydell J, St-Laurent L, Prévost D, Gustafsson F (2012) Crowd analysis with target tracking, K-means clustering and hidden Markov models. In: 2012 15th international conference on information fusion, IEEE, pp 1903–1910
Dee HM, Caplier A (2010) Crowd behaviour analysis using histograms of motion direction. In: 2010 IEEE international conference on image processing, IEEE, pp 1545–1548
Shi J et al (1994) Good features to track. In: 1994 proceedings of IEEE conference on computer vision and pattern recognition, IEEE, pp 593–600
de-la-Calle-Silos E, González-Díaz I, Díaz-de-María E (2013) Mid-level feature set for specific event and anomaly detection in crowded scenes. In: 2013 IEEE international conference on image processing, IEEE, pp 4001–4005
Yogameena B, Perumal SS, Packiyaraj N, Saravanan P (2012) Ma-Th algorithm for people count in a dense crowd and their behaviour classification. In: 2012 international conference on machine vision and image processing (MVIP), IEEE, pp 17–20
Gibson JJ (1950) The perception of the visual world. Hooughton Mifflin Boston
Kajo I, Malik AS, Kamel N (2016) An evaluation of optical flow algorithms for crowd analytics in surveillance system. In: 2016 6th international conference on intelligent and advanced systems (ICIAS), IEEE, pp 1–6
Kajo I, Malik AS, Kamel N (2015) Motion estimation of crowd flow using optical flow techniques: a review. In: 2015 9th international conference on signal processing and communication systems (ICSPCS), IEEE, pp 1–9
Lahiri S, Jyoti N, Pyati S, Dewan J (2018) Abnormal crowd behavior detection using image processing. In: 2018 fourth international conference on computing communication control and automation (ICCUBEA), IEEE, pp 1–5
Yuan Y, Fang J, Wang Q (2014) Online anomaly detection in crowd scenes via structure analysis. IEEE Trans Cybern 45(3):548–561
Yi S (2016) Pedestrian behavior modeling and understanding in crowds. The Chinese University of Hong Kong, Hong Kong
Cao L, Zhang X, Ren W, Huang K (2015) Large scale crowd analysis based on convolutional neural network. Pattern Recogn 48(10):3016–3024
Kang K, Wang X (2014). Fully convolutional neural networks for crowd segmentation. arXiv preprint arXiv:1411.4464
Shao J, Loy CC, Kang K, Wang X (2016) Slicing convolutional neural network for crowd video understanding. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 5620–5628
Burney A, Syed TQ (2016) Crowd video classification using convolutional neural networks. In: 2016 international conference on Frontiers of information technology (FIT), IEEE, pp 247–251
Sabokrou M, Fayyaz M, Fathy M, Klette R (2016) Fully convolutional neural network for fast anomaly detection in crowded scenes. arXiv preprint arXiv:1609.00866
Zhou S, Shen W, Zeng D, Fang M, Wei Y, Zhang Z (2016) Spatial–temporal convolutional neural networks for anomaly detection and localization in crowded scenes. Signal Process: Image Commun 47:358–368
Smeureanu S, Ionescu RT, Popescu M, Alexe B (2017) Deep appearance features for abnormal behavior detection in video. In: International conference on image analysis and processing. Springer, Cham, pp 779–789
Halder R, Chatterjee R (2020) CNN-BiLSTM model for violence detection in smart surveillance. SN Comput Sci 1(4):1–9
Ding C, Fan S, Zhu M, Feng W, Jia B (2014) Violence detection in video by using 3D convolutional neural networks. In: International symposium on visual computing. Springer, Cham, pp 551–558
Li J, Jiang X, Sun T, Xu K (2019) Efficient violence detection using 3d convolutional neural networks. In: 2019 16th IEEE international conference on advanced video and signal based surveillance (AVSS), IEEE, pp 1–8
Demarty CH, Penet C, Soleymani M, Gravier G (2015) VSD, a public dataset for the detection of violent scenes in movies: design, annotation, analysis and evaluation. Multimed Tools Appl 74(17):7379–7404
Li C, Zhu L, Zhu D, Chen J, Pan Z, Li X, Wang B (2018) End-to-end multiplayer violence detection based on deep 3D CNN. In: Proceedings of the 2018 VII international conference on network, communication and computing, pp 227–230
Arandjelovic R, Gronat P, Torii A, Pajdla T, Sivic J (2016) NetVLAD: CNN architecture for weakly supervised place recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 5297–5307
Mu G, Cao H, Jin Q (2016) Violent scene detection using convolutional neural networks and deep audio features. In: Chinese conference on pattern recognition. Springer, Singapore, pp 451–463
Abdali AMR, Al-Tuma RF (2019) Robust real-time violence detection in video using cnn and lstm. In: 2019 2nd scientific conference of computer sciences (SCCS), IEEE, pp 104–108
Mumtaz A, Sargano AB, Habib Z (2018) Violence detection in surveillance videos with deep network using transfer learning. In: 2018 2nd European conference on electrical engineering and computer science (EECS), IEEE, pp 558–563
Sudhakaran S, Lanz O (2017) Learning to detect violent videos using convolutional long short-term memory. In: 2017 14th IEEE international conference on advanced video and signal based surveillance (AVSS), IEEE, pp 1–6
Hanson A, Pnvr K, Krishnagopal S, Davis L (2018) Bidirectional convolutional lstm for the detection of violence in videos. In: Proceedings of the European conference on computer vision (ECCV) workshops
Meng Z, Yuan J, Li Z (2017) Trajectory-pooled deep convolutional networks for violence detection in videos. In: International conference on computer vision systems. Springer, Cham, pp 437–447
Song W, Zhang D, Zhao X, Yu J, Zheng R, Wang A (2019) A novel violent video detection scheme based on modified 3d convolutional neural networks. IEEE Access 7:39172–39179
Ullah FUM, Ullah A, Muhammad K, Haq IU, Baik SW (2019) Violence detection using spatiotemporal features with 3D convolutional neural network. Sensors 19(11):2472
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
There is no conflict of interests regarding the publication of this manuscript.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Tyagi, B., Nigam, S. & Singh, R. A Review of Deep Learning Techniques for Crowd Behavior Analysis. Arch Computat Methods Eng 29, 5427–5455 (2022). https://doi.org/10.1007/s11831-022-09772-1
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11831-022-09772-1