[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ Skip to main content
Log in

A Review of Deep Learning Techniques for Crowd Behavior Analysis

  • Review article
  • Published:
Archives of Computational Methods in Engineering Aims and scope Submit manuscript

Abstract

In today’s scenario, there are frequent events (viz. political rallies, live concerts, strikes, sports meet) occur in which many people gather to participate in the event. In crowded areas possibility of occurrence of suspicious activities and violence automatically increases. To attenuate these issues, it is very important to develop automated systems that can detect anomalies in the diverse and complex outdoor environments, especially at public places to ensure safety and to avoid crowd disasters such as human stampede, mob lynching, and riots. Crowd management may help to circumvent the crowd disaster and ensure public safety at places like temples, railway stations, airports, bus terminals, religious functions, political rallies etc. We need a visual surveillance that can automatically detect abnormality in crowd behavior so that the relevant action can be taken to prevent any public casualty. Basic steps required for crowd analysis is density estimation and crowd counting, object recognition, tracking, and anomaly detection in crowded scene. Here, we have systematically reviewed and compared different methods that are being used for crowd analysis. The comparative analysis of existing methods has been presented on different available datasets and taxonomies are also compared. We also propose our own taxonomy for crowd analysis and datasets.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
£29.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price includes VAT (United Kingdom)

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16
Fig. 17
Fig. 18
Fig. 19
Fig. 20
Fig. 21
Fig. 22

Similar content being viewed by others

References

  1. https://medium.com/@bharathvarmaavs/list-of-mob-lynching-incidents-in-india-2019-5b97773f677f. Accessed 20 Mar 2021

  2. https://economictimes.indiatimes.com/news/politics-and-nation/mumbai-people-protest-at-marine-drive-against-caa-violence-in-northeast-delhi/videoshow/74297667.cms?from=mdr. Accessed 20 Mar 2021

  3. https://www.cnbc.com/2019/08/05/article-370-what-is-happening-in-kashmir-india-revokes-special-status.html. Accessed 20 Mar 2021

  4. https://www.indiatoday.in/news-analysis/story/beyond-jnu-when-university-campuses-different-jamia-amu-1634384-2020-01-06. Accessed 20 Mar 2021

  5. https://www.theguardian.com/world/2020/jan/06/students-injured-in-india-after-masked-attackers-raid-top-university. Accessed 20 Mar 2021

  6. https://www.thenational.ae/world/asia/a-history-of-violence-at-indian-universities-1.557030. Accessed 20 Mar 2021

  7. Ozturk O, Yamasaki T, Aizawa K (2010) Detecting dominant motion flows in unstructured/structured crowd scenes. In: 20th international conference on pattern recognition. IEEE, pp 3533–3536

  8. Zitouni MS, Sluzek A, Bhaskar H (2019) Visual analysis of socio-cognitive crowd behaviors for surveillance: a survey and categorization of trends and methods. Eng Appl Artif Intell 82:294–312

    Article  Google Scholar 

  9. Zhan B, Monekosso DN, Remagnino P, Velastin SA, Xu LQ (2008) Crowd analysis: a survey. Mach Vis Appl 19(5):345–357

    Article  Google Scholar 

  10. Junior JCSJ, Musse SR, Jung CR (2010) Crowd analysis using computer vision techniques. IEEE Signal Process Mag 27(5):66–77

    Google Scholar 

  11. Bansod SD, Nandedkar AV (2020) Crowd anomaly detection and localization using histogram of magnitude and momentum. Vis Comput 36(3):609–620

    Article  Google Scholar 

  12. Kok VJ, Lim MK, Chan CS (2016) Crowd behavior analysis: a review where physics meets biology. Neurocomputing 177:342–362

    Article  Google Scholar 

  13. Yogameena B, Nagananthini C (2017) Computer vision based crowd disaster avoidance system: a survey. Int J Disaster Risk Reduct 22:95–129

    Article  Google Scholar 

  14. Zhang X, Yu Q, Yu H (2018) Physics inspired methods for crowd video surveillance and analysis: a survey. IEEE Access 6:66816–66830

    Article  Google Scholar 

  15. Nayan N, Sahu SS, Kumar S (2019) Detecting anomalous crowd behavior using correlation analysis of optical flow. SIViP 13(6):1233–1241

    Article  Google Scholar 

  16. Afiq AA, Zakariya MA, Saad MN, Nurfarzana AA, Khir MHM, Fadzil AF, Faizari M (2019) A review on classifying abnormal behavior in crowd scene. J Vis Commun Image Represent 58:285–303

    Article  Google Scholar 

  17. Mehran R, Oyama A, Shah M (2009) Abnormal crowd behavior detection using social force model. In: 2009 IEEE conference on computer vision and pattern recognition. IEEE, pp 935–942

  18. Bour P, Cribelier E, Argyriou V (2019) Crowd behavior analysis from fixed and moving cameras. In: Multimodal behavior analysis in the wild, Academic Press, Cambridge, pp 289–322

  19. Irfan M, Marcenaro L, Tokarchuk L (2016) Crowd analysis using visual and non-visual sensors, a survey. In: 2016 IEEE global conference on signal and information processing (GlobalSIP), IEEE, pp 1249–1254

  20. Draghici A, Steen MV (2018) A survey of techniques for automatically sensing the behavior of a crowd. ACM Comput Surv (CSUR) 51(1):1–40

    Article  Google Scholar 

  21. Srivastava S, Khari M, Crespo RG, Chaudhary G, Arora P (2021) Concepts and real-time applications of deep learning. Springer, Cham

    Book  Google Scholar 

  22. Ko T (2008) A survey on behavior analysis in video surveillance for homeland security applications. In: 2008 37th IEEE applied imagery pattern recognition workshop, IEEE, pp 1–8

  23. Candamo J, Shreve M, Goldgof DB, Sapper DB, Kasturi R (2009) Understanding transit scenes: a survey on human behavior-recognition algorithms. IEEE Trans Intell Transp Syst 11(1):206–224

    Article  Google Scholar 

  24. Kim IS, Choi HS, Yi KM, Choi JY, Kong SG (2010) Intelligent visual surveillance—a survey. Int J Control Autom Syst 8(5):926–939

    Article  Google Scholar 

  25. Ko T (2011) A survey on behaviour analysis in video surveillance applications. In: Video surveillance, InTech, London, pp 279–294

  26. Muhammadi J, Rabiee HR, Hosseini A (2013) Crowd labeling: a survey. arXiv preprint arXiv:1301.2774

  27. Gowsikhaa D, Abirami S, Baskaran R (2014) Automated human behavior analysis from surveillance videos: a survey. Artif Intell Rev 42(4):747–765

    Article  Google Scholar 

  28. Li T, Chang H, Wang M, Ni B, Hong R, Yan S (2014) Crowded scene analysis: a survey. IEEE Trans Circuits Syst Video Technol 25(3):367–386

    Article  Google Scholar 

  29. Khan MT, Ali A, Durrani MY, Siddiqui I (2015) Survey of holistic crowd analysis models. J Comput Sci Commun 1(1):1–9

    Google Scholar 

  30. Ijaz K, Sohail S, Hashish S (2015) A survey of latest approaches for crowd simulation and modeling using hybrid techniques. In: 17th UKSIMAMSS international conference on modelling and simulation, pp 111–116

  31. Zitouni MS, Bhaskar H, Dias J, Al-Mualla ME (2016) Advances and trends in visual crowd analysis: a systematic survey and evaluation of crowd modelling techniques. Neurocomputing 186:139–159

    Article  Google Scholar 

  32. Zhang Y, Ma Y (2016) Vision-based human behavior recognition and behavior understanding: a survey. In: 川 4th international conference on machinery, materials and computing technology (ICMMCT). Hangzhou, China: ICMMCT, pp 1899–1902

  33. Swathi HY, Shivakumar G, Mohana HS (2017) Crowd behavior analysis: a survey. In: 2017 international conference on recent advances in electronics and communication technology (ICRAECT), IEEE, pp 169–178

  34. Lamba S, Nain N (2017) Crowd monitoring and classification: a survey. In: Advances in computer and computational sciences. Springer, Singapore, pp 21–31

    Chapter  Google Scholar 

  35. Ma J, Dai Y, Hirota K (2017) A survey of video-based crowd anomaly detection in dense scenes. J Adv Comput Intell Intell Inform 21(2):235–246

    Article  Google Scholar 

  36. Sindagi VA, Patel VM (2018) A survey of recent advances in cnn-based single image crowd counting and density estimation. Pattern Recogn Lett 107:3–16

    Article  Google Scholar 

  37. Savitha C, Ramesh D (2018) Motion detection in video surviellance: a systematic survey. In: 2018 2nd international conference on inventive systems and control (ICISC), IEEE, pp 51–54

  38. Tripathi G, Singh K, Vishwakarma DK (2019) Convolutional neural networks for crowd behaviour analysis: a survey. Vis Comput 35(5):753–776

    Article  Google Scholar 

  39. Sreenu G, Durai MS (2019) Intelligent video surveillance: a review through deep learning techniques for crowd analysis. J Big Data 6(1):1–27

    Article  Google Scholar 

  40. Elbishlawi S, Abdelpakey MH, Eltantawy A, Shehata MS, Mohamed MM (2020) Deep learning-based crowd scene analysis survey. J Imaging 6(9):95

    Article  Google Scholar 

  41. Abdou M, Erradi A (2020) Crowd counting: a survey of machine learning approaches. In: 2020 IEEE international conference on informatics, IoT, and enabling technologies (ICIoT), IEEE, pp 48–54

  42. Luo Y, Lu J, Zhang B (2020) Crowd counting for static images: a survey of methodology. In: 2020 39th Chinese control conference (CCC), IEEE, pp 6602–6607

  43. Jingying W (2021) A survey on crowd counting methods and datasets. In: Advances in computer, communication and computational sciences. Springer, Singapore, pp 851–863

    Chapter  Google Scholar 

  44. Bendali-Braham M, Weber J, Forestier G, Idoumghar L, Muller PA (2021) Recent trends in crowd analysis: a review. Mach Learn Appl 4:100023

    Google Scholar 

  45. Baccouche M, Mamalet F, Wolf C, Garcia C, Baskurt A (2011) Sequential deep learning for human action recognition. In: International workshop on human behavior understanding. Springer, Berlin, pp 29–39

    Chapter  Google Scholar 

  46. Marsden M, McGuinness K, Little S, O'Connor NE (2017) Resnetcrowd: a residual deep learning architecture for crowd counting, violent behaviour detection and crowd density level classification. In: 2017 14th IEEE international conference on advanced video and signal based surveillance (AVSS), IEEE, pp 1–7

  47. Sindagi VA, Patel VM (2017) Cnn-based cascaded multi-task learning of high-level prior and density estimation for crowd counting. In: 14th IEEE international conference on advanced video and signal based surveillance (AVSS), pp 1–6

  48. Tran D, Wang H, Torresani L, Ray J, LeCun Y, Paluri M (2018) A closer look at spatiotemporal convolutions for action recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 6450–6459

  49. Carreira J, Zisserman A (2017) Quo vadis, action recognition? A new model and the kinetics dataset. In: proceedings of the IEEE conference on computer vision and pattern recognition, pp 6299–6308

  50. You Q, Jiang H (2018) Action4d: real-time action recognition in the crowd and clutter. arXiv preprint arXiv:1806.02424

  51. Wojke N, Bewley A, Paulus D (2017) Simple online and realtime tracking with a deep association metric. In: 2017 IEEE international conference on image processing (ICIP), IEEE, pp 3645–3649

  52. Singh A, Patil D, Omkar SN (2018) Eye in the sky: real-time drone surveillance system (DSS) for violent individuals identification using scatternet hybrid deep learning network. In: Proceedings of the IEEE conference on computer vision and pattern recognition workshops, pp 1629–1637

  53. Ravanbakhsh M, Nabi M, Mousavi H, Sangineto E, Sebe N (2018) Plug-and-play cnn for crowd motion analysis: an application in abnormal event detection. In: 2018 IEEE winter conference on applications of computer vision (WACV), IEEE, pp 1689–1698

  54. Vahora SA, Chauhan NC (2019) Deep neural network model for group activity recognition using contextual relationship. Eng Sci Technol 22(1):47–54

    Google Scholar 

  55. Shu T, Todorovic S, Zhu SC (2017) CERN: confidence-energy recurrent network for group activity recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 5523–5531

  56. Wei X, Du J, Xue Z, Liang M, Geng Y, Xu X, Lee J (2020) A very deep two-stream network for crowd type recognition. Neurocomputing 396:522–533

    Article  Google Scholar 

  57. Yan L, Zhu M, Yu C (2019) Crowd video captioning. arXiv preprint arXiv:1911.05449

  58. Ullah H, Khan SD, Ullah M, Cheikh FA, Uzair M (2019) Two stream model for crowd video classification. In: 2019 8th European workshop on visual information processing (EUVIP), IEEE, pp 93–98

  59. Wan J, Chan A (2019) Adaptive density map generation for crowd counting. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 1130–1139

  60. Wang Q, Gao J, Lin W, Yuan Y (2019) Learning from synthetic data for crowd counting in the wild. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 8198–8207

  61. Alahi A, Goel K, Ramanathan V, Robicquet A, Fei-Fei L, Savarese S (2016) Social lstm: human trajectory prediction in crowded spaces. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 961–971

  62. Bartoli F, Lisanti G, Seidenari L, Del Bimbo A (2017) PACE: prediction-based annotation for crowded environments. In: Proceedings of the 2017 ACM on international conference on multimedia retrieval, pp 121–124

  63. Bisagno N, Zhang B, Conci N (2018) Group lstm: group trajectory prediction in crowded scenarios. In: Proceedings of the European conference on computer vision (ECCV) workshops

  64. Singh K, Rajora S, Vishwakarma DK, Tripathi G, Kumar S, Walia GS (2020) Crowd anomaly detection using aggregation of ensembles of fine-tuned convnets. Neurocomputing 371:188–198

    Article  Google Scholar 

  65. Lin S, Yang H, Tang X, Shi T, Chen L (2019) Social mil: interaction-aware for crowd anomaly detection. In: 2019 16th IEEE international conference on advanced video and signal based surveillance (AVSS), IEEE, pp 1–8

  66. Dalal R, Khari M, Garg A, Gupta D, Gautam A (2021) False media detection by using deep learning. In: Multimodal biometric systems. CRC Press, Boca Raton, pp 79–88

    Chapter  Google Scholar 

  67. Pillai MS, Chaudhary G, Khari M, Crespo RG (2021) Real-time image enhancement for an automatic automobile accident detection through CCTV using deep learning. Soft Comput 25(18):11929–11940

    Article  Google Scholar 

  68. Rajagopal A, Joshi GP, Ramachandran A, Subbalakshmi R, Khari M, Jha S, Shankar K, You J (2020) A deep learning model based on multi-objective particle swarm optimization for scene classification in unmanned aerial vehicles. IEEE Access 8:135383–135393

    Article  Google Scholar 

  69. LeCun Y (2015) LeNet-5, convolutional neural networks. 20(5):14. http://yann.lecun.com/exdb/lenet

  70. Simonyan K, Zisserman A (2015) Vggnet. In: 3rd Int. Conf. Learn. Represent. ICLR

  71. Szegedy C, Liu W, Jia Y, Sermanet P, Reed S, Anguelov D et al (2015) Going deeper with convolutions. In: Proceedings of the IEEE conference on computer vision and pattern recognition. pp 1–9

  72. Targ S, Almeida D, Lyman K (2016) Resnet in resnet: generalizing residual architectures. arXiv preprint arXiv:1603.08029

  73. Bhangale U, Patil S, Vishwanath V, Thakker P, Bansode A, Navandhar D (2020) Near real-time crowd counting using deep learning approach. Procedia Comput Sci 171:770–779

    Article  Google Scholar 

  74. Sagar A (2020) Bayesian multi scale neural network for crowd counting. arXiv preprint arXiv:2007.14245

  75. Chen Y (2020) Crowd behaviour recognition using enhanced butterfly optimization algorithm based recurrent neural network. Multimed Res. https://doi.org/10.46253/j.mr.v3i3.a3

    Article  Google Scholar 

  76. Murugesan M, Thilagamani S (2020) Efficient anomaly detection in surveillance videos based on multi layer perception recurrent neural network. Microprocess Microsyst 79:103303

    Article  Google Scholar 

  77. Hochreiter S, Schmidhuber J (1996) LSTM can solve hard long time lag problems. Adv Neural Inf Process Syst 9:473–479

    Google Scholar 

  78. Li B, Su W (2021) Long short-term memory network-based user behavior analysis in virtual reality training system—a case study of the ship communication and navigation equipment training. Arab J Geosci 14(1):1–8

    Article  Google Scholar 

  79. Amrutha CV, Jyotsna C, Amudha J (2020) Deep learning approach for suspicious activity detection from surveillance video. In: 2020 2nd international conference on innovative mechanisms for industry applications (ICIMIA), IEEE, pp 335–339

  80. Mahadevan V, Li W, Bhalodia V, Vasconcelos N (2010) Anomaly detection in crowded scenes. In: 2010 IEEE computer society conference on computer vision and pattern recognition, IEEE, pp 1975–1981

  81. Ferryman J, Shahrokni A (2009) Pets2009: dataset and challenge. In: 2009 twelfth IEEE international workshop on performance evaluation of tracking and surveillance, IEEE, pp 1–6

  82. Hassner T, Itcher Y, Kliper-Gross O (2012) Violent flows: real-time detection of violent crowd behavior. In: 2012 IEEE computer society conference on computer vision and pattern recognition workshops, IEEE, pp 1–6

  83. Zhang Y, Zhou D, Chen S, Gao S, Ma Y (2016) Single-image crowd counting via multi-column convolutional neural network. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 589–597

  84. Soomro K, Zamir AR, Shah M (2012) UCF101: a dataset of 101 human actions classes from videos in the wild. arXiv preprint arXiv:1212.0402

  85. Wu Y, Lim J, Yang MH (2013) Online object tracking: a benchmark. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2411–2418

  86. Milan A, Leal-Taixé L, Reid I, Roth S, Schindler K (2016) MOT16: a benchmark for multi-object tracking. arXiv preprint arXiv:1603.00831

  87. Sultani W, Chen C, Shah M (2018) Real-world anomaly detection in surveillance videos. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 6479–6488

  88. Fang Y, Zhan B, Cai W, Gao S, Hu B (2019) Locality-constrained spatial transformer network for video crowd counting. In: 2019 IEEE international conference on multimedia and expo (ICME), IEEE, pp 814–819

  89. Chen K, Loy CC, Gong S, Xiang T (2012) Feature mining for localised crowd counting. Bmvc 1(2):3

    Google Scholar 

  90. Shao J, Kang K, Change Loy C, Wang X (2015) Deeply learned attributes for crowded scene understanding. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 4657–4666

  91. Umn. http://mha.cs.umn.edu/Movies/Crowd-Activity-All.avi

  92. Ucf-cc-50. https://www.crcv.ucf.edu/data/UCFCrowdsDataset.zip

  93. Amir Sjarif NN, Shamsuddin SM, Mohd Hashim SZ, Yuhaniz SS (2011) Crowd analysis and its applications. International conference on software engineering and computer systems. Springer, Berlin, pp 687–697

    Chapter  Google Scholar 

  94. Di Stefano L, Neri G, Viarani E (2001) Analysis of pixel-level algorithms for video surveillance applications. In: Proceedings 11th international conference on image analysis and processing, IEEE, pp 541–546

  95. Davies AC, Yin JH, Velastin SA (1995) Crowd monitoring using image processing. Electron Commun Eng J 7(1):37–47

    Article  Google Scholar 

  96. Regazzoni CS, Tesei A, Murino V (1993) A real-time vision system for crowding monitoring. In: Proceedings of IECON'93–19th annual conference of IEEE industrial electronics, IEEE, pp 1860–1864

  97. Cho SY, Chow TW, Leung CT (1999) A neural-based crowd estimation by hybrid global learning algorithm. IEEE Trans Syst Man Cybern B 29(4):535–541

    Article  Google Scholar 

  98. Yang DB, González-Banos HH, Guibas LJ (2003) Counting people in crowds with a real-time network of simple image sensors. ICCV 3:122

    Google Scholar 

  99. Wu X, Liang G, Lee KK, Xu Y (2006) Crowd density estimation using texture analysis and learning. In: 2006 IEEE international conference on robotics and biomimetics, IEEE, pp 214–219

  100. Marana AN, Costa LDF, Lotufo RA, Velastin SA (1998) On the efficacy of texture analysis for crowd monitoring. In: Proceedings SIBGRAPI'98. international symposium on computer graphics, image processing, and vision (cat. no. 98EX237), IEEE, pp 354–361

  101. Rahmalan H, Nixon MS, Carter JN (2006) On crowd density estimation for surveillance. In: IET conference on crime and security, IET, pp 540–545

  102. Chan AB, Liang ZSJ, Vasconcelos N (2008) Privacy preserving crowd monitoring: counting people without people models or tracking. In: 2008 IEEE conference on computer vision and pattern recognition, IEEE, pp 1–7

  103. Lin SF, Chen JY, Chao HX (2001) Estimation of number of people in crowded scenes using perspective transformation. IEEE Trans Syst Man Cybern A 31(6):645–654

    Article  Google Scholar 

  104. Zhao T, Nevatia R (2003) Bayesian human segmentation in crowded situations. In: 2003 IEEE computer society conference on computer vision and pattern recognition, 2003. Proceedings, vol 2, IEEE, pp II-459

  105. Leibe B, Seemann E, Schiele B (2005) Pedestrian detection in crowded scenes. In: 2005 IEEE computer society conference on computer vision and pattern recognition (CVPR'05), vol 1, IEEE, pp 878–885

  106. Rabaud V, Belongie S (2006) Counting crowded moving objects. In: 2006 IEEE computer society conference on computer vision and pattern recognition (CVPR'06), vol 1, IEEE, pp 705–711

  107. Ilyas N, Shahzad A, Kim K (2019) Convolutional-neural network-based image crowd counting: review, categorization, analysis, and performance evaluation. Sensors 20(1):43

    Article  Google Scholar 

  108. Fu M, Xu P, Li X, Liu Q, Ye M, Zhu C (2015) Fast crowd density estimation with convolutional neural networks. Eng Appl Artif Intell 43:81–88

    Article  Google Scholar 

  109. Wang C, Zhang H, Yang L, Liu S, Cao X (2015) Deep people counting in extremely dense crowds. In: Proceedings of the 23rd ACM international conference on Multimedia, pp 1299–1302

  110. Walach E, Wolf L (2016) Learning to count with CNN boosting. In: European conference on computer vision. Springer, Cham, pp 660–676

    Google Scholar 

  111. Liu M, Jiang J, Guo Z, Wang Z, Liu Y (2018) Crowd counting with fully convolutional neural network. In: 2018 25th IEEE international conference on image processing (ICIP), IEEE, pp 953–957

  112. Babu Sam D, Surya S, Venkatesh Babu R (2017) Switching convolutional neural network for crowd counting. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 5744–5752

  113. Zhang L, Shi M, Chen Q (2018) Crowd counting via scale-adaptive convolutional neural network. In: 2018 IEEE winter conference on applications of computer vision (WACV), IEEE, pp 1113–1121

  114. Chen X, Bin Y, Sang N, Gao C (2019) Scale pyramid network for crowd counting. In: 2019 IEEE winter conference on applications of computer vision (WACV), IEEE, pp 1941–1950

  115. Boominathan L, Kruthiventi SS, Babu RV (2016) Crowdnet: a deep convolutional network for dense crowd counting. In: Proceedings of the 24th ACM international conference on Multimedia, pp 640–644

  116. Hossain M, Hosseinzadeh M, Chanda O, Wang Y (2019) Crowd counting using scale-aware attention networks. In: 2019 IEEE winter conference on applications of computer vision (WACV), IEEE, pp 1280–1288

  117. Cao X, Wang Z, Zhao Y, Su F (2018) Scale aggregation network for accurate and efficient crowd counting. In: Proceedings of the European conference on computer vision (ECCV), pp 734–750

  118. Kang D, Chan A (2018) Crowd counting by adaptively fusing predictions from an image pyramid. arXiv preprint arXiv:1805.06115

  119. Shang C, Ai H, Bai B (2016) End-to-end crowd counting via joint learning local and global count. In: 2016 IEEE international conference on image processing (ICIP), IEEE, pp 1215–1219

  120. Sheng B, Shen C, Lin G, Li J, Yang W, Sun C (2016) Crowd counting via weighted VLAD on a dense attribute feature map. IEEE Trans Circuits Syst Video Technol 28(8):1788–1797

    Article  Google Scholar 

  121. Li Y, Zhang X, Chen D (2018) Csrnet: dilated convolutional neural networks for understanding the highly congested scenes. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1091–1100

  122. Sindagi VA, Patel VM (2017) Generating high-quality crowd density maps using contextual pyramid cnns. In: Proceedings of the IEEE international conference on computer vision, pp 1861–1870

  123. Zhao Z, Li H, Zhao R, Wang X (2016) Crossing-line crowd counting with two-phase deep neural networks. In: European conference on computer vision. Springer, Cham, pp 712–726

    Google Scholar 

  124. [link]. http://cvlab.uestc.edu.cn/CDECNN/DatasetChunxiRoad.zip

  125. Ma W, Huang L, Liu C (2008) Crowd estimation using multi-scale local texture analysis and confidence-based soft classification. In: 2008 second international symposium on intelligent information technology application, vol 1, IEEE, pp 142–146

  126. Ma W, Huang L, Liu C (2010) Crowd density analysis using co-occurrence texture features. In: 5th international conference on computer sciences and convergence information technology, IEEE, pp 170–175

  127. Zhang C, Li H, Wang X, Yang X (2015) Cross-scene crowd counting via deep convolutional neural networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 833–841

  128. Zeng X, Wu Y, Hu S, Wang R, Ye Y (2020) DSPNet: deep scale purifier network for dense crowd counting. Expert Syst Appl 141:112977

    Article  Google Scholar 

  129. Ranjan V, Le H, Hoai M (2018) Iterative crowd counting. In: Proceedings of the European conference on computer vision (ECCV), pp 270–285

  130. Gao J, Wang Q, Li X (2019) Pcc net: perspective crowd counting via spatial convolutional network. IEEE Trans Circuits Syst Video Technol 30(10):3486–3498

    Article  Google Scholar 

  131. Guo D, Li K, Zha ZJ, Wang M (2019) Dadnet: dilated-attention-deformable convnet for crowd counting. In: Proceedings of the 27th ACM international conference on multimedia, pp 1823–1832

  132. Dai F, Liu H, Ma Y, Zhang X, Zhao Q (2021) Dense scale network for crowd counting. In: Proceedings of the 2021 international conference on multimedia retrieval, pp 64–72

  133. Cenggoro TW, Aslamiah AH, Yunanto A (2019) Feature pyramid networks for crowd counting. Procedia Comput Sci 157:175–182

    Article  Google Scholar 

  134. Zhang Y, Zhou C, Chang F, Kot AC (2019) Multi-resolution attention convolutional neural network for crowd counting. Neurocomputing 329:144–152

    Article  Google Scholar 

  135. Sindagi VA, Patel VM (2019) Ha-ccn: hierarchical attention-based crowd counting network. IEEE Trans Image Process 29:323–335

    Article  MathSciNet  MATH  Google Scholar 

  136. Liu L, Lu H, Zou H, Xiong H, Cao Z, Shen C (2020) Weighing counts: sequential crowd counting by reinforcement learning. In: European conference on computer vision. Springer, Cham, pp 164–181

    Google Scholar 

  137. Shi X, Li X, Wu C, Kong S, Yang J, He L (2020) A real-time deep network for crowd counting. In: ICASSP 2020–2020 IEEE international conference on acoustics, speech and signal processing (ICASSP), IEEE, pp 2328–2332

  138. Luo A, Yang F, Li X, Nie D, Jiao Z, Zhou S, Cheng H (2020) Hybrid graph neural networks for crowd counting. Proc AAAI Conf Artif Intell 34(7):11693–11700

    Google Scholar 

  139. Zhou JT, Zhang L, Jiawei D, Peng X, Fang Z, Xiao Z, Zhu H (2021) Locality-aware crowd counting. IEEE Trans Pattern Anal Mach Intell. https://doi.org/10.1109/TPAMI.2021.3056518

    Article  Google Scholar 

  140. Xiong F, Shi X, Yeung DY (2017) Spatiotemporal modeling for crowd counting in videos. In: Proceedings of the IEEE international conference on computer vision, pp 5151–5159

  141. Gao J, Yuan Y, Wang Q (2020) Feature-aware adaptation and density alignment for crowd counting in video surveillance. IEEE Trans Cybern 51(10):4822–4833

    Article  Google Scholar 

  142. [link]. https://www.crcv.ucf.edu/data/ucf-qnrf/

  143. Idrees H, Tayyab M, Athrey K, Zhang D, Al-Maadeed S, Rajpoot N, Shah M (2018) Composition loss for counting, density map estimation and localization in dense crowds. In: Proceedings of the European conference on computer vision (ECCV), pp 532–546

  144. Wang P, Gao C, Wang Y, Li H, Gao Y (2020) MobileCount: an efficient encoder-decoder framework for real-time crowd counting. Neurocomputing 407:292–299

    Article  Google Scholar 

  145. Huang G, Liu Z, Van Der Maaten L, Weinberger KQ (2017) Densely connected convolutional networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 4700–4708

  146. Yilmaz A, Javed O, Shah M (2006) Object tracking: a survey. Acm Comput Surv (CSUR) 38(4):13-es

    Article  Google Scholar 

  147. Huang Y, Huang TS, Niemann H (2002) A region-based method for model-free object tracking. In: Object recognition supported by user interaction for service robots, vol 1, IEEE, pp 592–595

  148. Schmaltz C, Rosenhahn B, Brox T, Cremers D, Weickert J, Wietzke L, Sommer G (2007). Region-based pose tracking. In: Iberian conference on pattern recognition and image analysis, Springer, Berlin, pp 56–63

  149. Lee H, Kim D (2018) Salient region-based online object tracking. In: 2018 IEEE winter conference on applications of computer vision (WACV), IEEE, pp 1170–1177

  150. Hang Y, Derong C, Jiulu G (2018) Object tracking using both a kernel and a non-parametric active contour model. Neurocomputing 295:108–117

    Article  Google Scholar 

  151. Sun W, Dong E (2019) Kullback-Leibler distance and graph cuts based active contour model for local segmentation. Biomed Signal Process Control 52:120–127

    Article  Google Scholar 

  152. Tang F, Tao H (2005) Object tracking with dynamic feature graph. In: 2005 IEEE international workshop on visual surveillance and performance evaluation of tracking and surveillance, IEEE, pp 25–32

  153. Yuan D, Zhang X, Liu J, Li D (2019) A multiple feature fused model for visual object tracking via correlation filters. Multimed Tools Appl 78(19):27271–27290

    Article  Google Scholar 

  154. Tsai YT, Shih HC, Huang CL (2006) Multiple human objects tracking in crowded scenes. In: 18th international conference on pattern recognition (ICPR'06), vol 3, IEEE, pp 51–54

  155. Chun J, Park S (2018) RGB-D model based human detection and tracking using 3D CCTV. In: 2018 9th IEEE annual ubiquitous computing, electronics & mobile communication conference (UEMCON), IEEE, pp 758–762

  156. Soleimanitaleb Z, Keyvanrad MA, Jafari A (2019) Object tracking methods: a review. In: 2019 9th international conference on computer and knowledge engineering (ICCKE), IEEE, pp 282–288

  157. Bu F, Cai Y, Yang Y (2016) Multiple object tracking based on faster-RCNN detector and KCF tracker. [Online]: http://www-personal.umich.edu/~yeeyoung/publication/MOT.pdf

  158. Chahyati D, Fanany MI, Arymurthy AM (2017) Tracking people by detection using CNN features. Procedia Comput Sci 124:167–172

    Article  Google Scholar 

  159. Agarwal A, Suryavanshi S (2017) Real-time* multiple object tracking (MOT) for autonomous navigation. Tech. rep

  160. Nam H, Han B (2016) Learning multi-domain convolutional neural networks for visual tracking. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 4293–4302

  161. Han B, Sim J, Adam H (2017) Branchout: regularization for online ensemble tracking with convolutional neural networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 3356–3365

  162. Chen K, Tao W (2017) Once for all: a two-flow convolutional neural network for visual tracking. IEEE Trans Circuits Syst Video Technol 28(12):3377–3386

    Article  Google Scholar 

  163. Wang X, Li C, Luo B, Tang J (2018). Sint++: robust visual tracking via adversarial positive instance generation. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 4864–4873

  164. Tao R, Gavves E, Smeulders AW (2016) Siamese instance search for tracking. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1420–1429

  165. Held D, Thrun S, Savarese S (2016) Learning to track at 100 fps with deep regression networks. In: European conference on computer vision. Springer, Cham, pp 749–765

    Google Scholar 

  166. Wang T, Ling H (2017) Gracker: a graph-based planar object tracker. IEEE Trans Pattern Anal Mach Intell 40(6):1494–1501

    Article  Google Scholar 

  167. Nam H, Baek M, Han B (2016) Modeling and propagating cnns in a tree structure for visual tracking. arXiv preprint arXiv:1608.07242

  168. Gomila C, Meyer F (2003) Graph-based object tracking. In: Proceedings 2003 international conference on image processing (cat. no. 03CH37429), vol 2, IEEE, pp II-41

  169. Chen L, Peng X, Ren M (2018) Recurrent metric networks and batch multiple hypothesis for multi-object tracking. IEEE Access 7:3093–3105

    Article  Google Scholar 

  170. Maksai A, Fua P (2018) Eliminating exposure bias and loss-evaluation mismatch in multiple object tracking. arXiv preprint arXiv:1811.10984

  171. Son J, Baek M, Cho M, Han B (2017). Multi-object tracking with quadruplet convolutional neural networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 5620–5629

  172. Wang B, Wang L, Shuai B, Zuo Z, Liu T, Luk Chan K, Wang G (2016) Joint learning of convolutional neural networks and temporally constrained metrics for tracklet association. In: Proceedings of the IEEE conference on computer vision and pattern recognition workshops, pp 1–8

  173. Leal-Taixé L, Canton-Ferrer C, Schindler K (2016) Learning by tracking: siamese CNN for robust target association. In: Proceedings of the IEEE conference on computer vision and pattern recognition workshops, pp 33–40

  174. Chu P, Fan H, Tan CC, Ling H (2019) Online multi-object tracking with instance-aware tracker and dynamic model refreshment. In: 2019 IEEE winter conference on applications of computer vision (WACV), IEEE, pp 161–170

  175. Xiang J, Zhang G, Hou J (2019) Online multi-object tracking based on feature representation and Bayesian filtering within a deep learning architecture. IEEE Access 7:27923–27935

    Article  Google Scholar 

  176. Yoon K, Kim DY, Yoon YC, Jeon M (2019) Data association for multi-object tracking via deep neural networks. Sensors 19(3):559

    Article  Google Scholar 

  177. Jiang Y, Shin H, Ko H (2018) Precise regression for bounding box correction for improved tracking based on deep reinforcement learning. In: 2018 IEEE international conference on acoustics, speech and signal processing (ICASSP), IEEE, pp 1643–1647

  178. Ren L, Lu J, Wang Z, Tian Q, Zhou J (2018) Collaborative deep reinforcement learning for multi-object tracking. In: Proceedings of the European conference on computer vision (ECCV), pp 586–602

  179. Fang K, Xiang Y, Li X, Savarese S (2018) Recurrent autoregressive networks for online multi-object tracking. In: 2018 IEEE winter conference on applications of computer vision (WACV), IEEE, pp 466–475

  180. Zhou H, Ouyang W, Cheng J, Wang X, Li H (2018) Deep continuous conditional random fields with asymmetric inter-object constraints for online multi-object tracking. IEEE Trans Circuits Syst Video Technol 29(4):1011–1022

    Article  Google Scholar 

  181. Chen L, Ai H, Shang C, Zhuang Z, Bai B (2017) Online multi-object tracking with convolutional neural networks. In: 2017 IEEE international conference on image processing (ICIP), IEEE, pp 645–649

  182. Sadeghian A, Alahi A, Savarese S (2017) Tracking the untrackable: learning to track multiple cues with long-term dependencies. In: Proceedings of the IEEE international conference on computer vision, pp 300–311

  183. Yang M, Wu Y, Jia Y (2017) A hybrid data association framework for robust online multi-object tracking. IEEE Trans Image Process 26(12):5667–5679

    Article  MathSciNet  MATH  Google Scholar 

  184. Chu Q, Ouyang W, Li H, Wang X, Liu B, Yu N (2017) Online multi-object tracking using CNN-based single object tracker with spatial-temporal attention mechanism. In: Proceedings of the IEEE international conference on computer vision, pp 4836–4845

  185. Bae SH, Yoon KJ (2017) Confidence-based data association and discriminative deep appearance learning for robust online multi-object tracking. IEEE Trans Pattern Anal Mach Intell 40(3):595–610

    Article  Google Scholar 

  186. Wang L, Xu L, Kim MY, Rigazico L, Yang MH (2017) Online multiple object tracking via flow and convolutional features. In: 2017 IEEE international conference on image processing (ICIP), IEEE, pp 3630–3634

  187. Milan A, Rezatofighi SH, Dick A, Reid I, Schindler K (2017) Online multi-target tracking using recurrent neural networks. In: Thirty-first AAAI conference on artificial intelligence

  188. Kim C, Li F, Ciptadi A, Rehg JM (2015) Multiple hypothesis tracking revisited. In: Proceedings of the IEEE international conference on computer vision, pp 4696–4704

  189. Kim C, Li F, Rehg JM (2018) Multi-object tracking with neural gating using bilinear lstm. In: Proceedings of the European conference on computer vision (ECCV), pp 200–215

  190. Babaee M, Li Z, Rigoll G (2018) Occlusion handling in tracking multiple people using RNN. In: 2018 25th IEEE international conference on image processing (ICIP), IEEE, pp 2715–2719

  191. Sheng H, Zhang Y, Chen J, Xiong Z, Zhang J (2018) Heterogeneous association graph fusion for target association in multiple object tracking. IEEE Trans Circuits Syst Video Technol 29(11):3269–3280

    Article  Google Scholar 

  192. Wen L, Du D, Li S, Bian X, Lyu S (2019) Learning non-uniform hypergraph for multi-object tracking. Proc AAAI Conf Artif Intell 33(1):8981–8988

    Google Scholar 

  193. Henschel R, Leal-Taixé L, Cremers D, Rosenhahn B (2018) Fusion of head and full-body detectors for multi-object tracking. In: Proceedings of the IEEE conference on computer vision and pattern recognition workshops, pp 1428–1437

  194. Ma C, Yang C, Yang F, Zhuang Y, Zhang Z, Jia H, Xie X (2018) Trajectory factory: tracklet cleaving and re-connection by deep siamese bi-gru for multiple object tracking. In: 2018 IEEE international conference on multimedia and expo (ICME), IEEE, pp 1–6

  195. Ma L, Tang S, Black MJ, Van Gool L (2018) Customized multi-person tracker. In: Asian conference on computer vision. Springer, Cham, pp 612–628

    Google Scholar 

  196. Chen J, Sheng H, Zhang Y, Xiong Z (2017) Enhancing detection model for multiple hypothesis tracking. In: Proceedings of the IEEE conference on computer vision and pattern recognition workshops, pp 18–27

  197. Tang S, Andriluka M, Andres B, Schiele B (2017) Multiple people tracking by lifted multicut and person re-identification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 3539–3548

  198. Gan W, Wang S, Lei X, Lee MS, Kuo CCJ (2018) Online CNN-based multiple object tracking with enhanced model updates and identity association. Signal Process: Image Commun 66:95–102

    Google Scholar 

  199. Zhu J, Yang H, Liu N, Kim M, Zhang W, Yang MH (2018) Online multi-object tracking with dual matching attention networks. In: Proceedings of the European conference on computer vision (ECCV), pp 366–382

  200. Chen L, Ai H, Zhuang Z, Shang C (2018) Real-time multiple people tracking with deeply learned candidate selection and person re-identification. In: 2018 IEEE international conference on multimedia and expo (ICME), IEEE, pp 1–6

  201. Sanchez-Matilla R, Poiesi F, Cavallaro A (2016) Online multi-target tracking with strong and weak detections. In: European conference on computer vision. Springer, Cham, pp 84–99

    Google Scholar 

  202. Kim M, Alletto S, Rigazio L (2016) Similarity mapping with enhanced siamese network for multi-object tracking. arXiv preprint arXiv:1609.09156

  203. Liang Y, Zhou Y (2018) LSTM multiple object tracker combining multiple cues. In: 2018 25th IEEE international conference on image processing (ICIP), IEEE, pp 2351–2355

  204. Lee S, Kim E (2018) Multiple object tracking via feature pyramid siamese networks. IEEE Access 7:8181–8194

    Article  Google Scholar 

  205. Fu Z, Angelini F, Naqvi SM, Chambers JA (2018) Gm-phd filter based online multiple human tracking using deep discriminative correlation matching. In: 2018 IEEE international conference on acoustics, speech and signal processing (ICASSP), IEEE, pp 4299–4303

  206. Ahsan U, Sun C, Hays J, Essa I (2017) Complex event recognition from images with few training examples. In: 2017 IEEE winter conference on applications of computer vision (WACV), IEEE, pp 669–678

  207. Sabokrou M, Fayyaz M, Fathy M, Klette R (2017) Deep-cascade: cascading 3d deep neural networks for fast anomaly detection and localization in crowded scenes. IEEE Trans Image Process 26(4):1992–2004

    Article  MathSciNet  MATH  Google Scholar 

  208. Feng Y, Yuan Y, Lu X (2017) Learning deep event models for crowd anomaly detection. Neurocomputing 219:548–556

    Article  Google Scholar 

  209. Ding T, Sznaier M, Camps OI (2008) Fast track matching and event detection. In: 2008 IEEE conference on computer vision and pattern recognition, IEEE, pp 1–8

  210. Cheriyadat AM, Radke RJ (2008) Detecting dominant motions in dense crowds. IEEE J Sel Top Signal Process 2(4):568–581

    Article  Google Scholar 

  211. Chang MB, Ullman T, Torralba A, Tenenbaum JB (2016) A compositional object-based approach to learning physical dynamics. arXiv preprint arXiv:1612.00341

  212. Wang G, Fu H, Liu Y (2016) Real time abnormal crowd behavior detection based on adjacent flow location estimation. In: 2016 4th international conference on cloud computing and intelligence systems (CCIS), IEEE, pp 476–479

  213. Wang L, Dong M (2012) Real-time detection of abnormal crowd behavior using a matrix approximation-based approach. In: 2012 19th IEEE international conference on image processing, IEEE, pp 2701–2704

  214. Chibloun A, El Fkihi S, Mliki H, Hammami M, Thami ROH (2018) Abnormal crowd behavior detection using speed and direction models. In: 2018 9th international symposium on signal, image, video and communications (ISIVC), IEEE, pp 197–202

  215. Zhang Y, Dong L, Xia Z, Li J (2015) Evolution of crowd behavior in surveillance video. In: 2015 IEEE international symposium on broadband multimedia systems and broadcasting, IEEE, pp 1–4

  216. Andersson M, Rydell J, St-Laurent L, Prévost D, Gustafsson F (2012) Crowd analysis with target tracking, K-means clustering and hidden Markov models. In: 2012 15th international conference on information fusion, IEEE, pp 1903–1910

  217. Dee HM, Caplier A (2010) Crowd behaviour analysis using histograms of motion direction. In: 2010 IEEE international conference on image processing, IEEE, pp 1545–1548

  218. Shi J et al (1994) Good features to track. In: 1994 proceedings of IEEE conference on computer vision and pattern recognition, IEEE, pp 593–600

  219. de-la-Calle-Silos E, González-Díaz I, Díaz-de-María E (2013) Mid-level feature set for specific event and anomaly detection in crowded scenes. In: 2013 IEEE international conference on image processing, IEEE, pp 4001–4005

  220. Yogameena B, Perumal SS, Packiyaraj N, Saravanan P (2012) Ma-Th algorithm for people count in a dense crowd and their behaviour classification. In: 2012 international conference on machine vision and image processing (MVIP), IEEE, pp 17–20

  221. Gibson JJ (1950) The perception of the visual world. Hooughton Mifflin Boston

  222. Kajo I, Malik AS, Kamel N (2016) An evaluation of optical flow algorithms for crowd analytics in surveillance system. In: 2016 6th international conference on intelligent and advanced systems (ICIAS), IEEE, pp 1–6

  223. Kajo I, Malik AS, Kamel N (2015) Motion estimation of crowd flow using optical flow techniques: a review. In: 2015 9th international conference on signal processing and communication systems (ICSPCS), IEEE, pp 1–9

  224. Lahiri S, Jyoti N, Pyati S, Dewan J (2018) Abnormal crowd behavior detection using image processing. In: 2018 fourth international conference on computing communication control and automation (ICCUBEA), IEEE, pp 1–5

  225. Yuan Y, Fang J, Wang Q (2014) Online anomaly detection in crowd scenes via structure analysis. IEEE Trans Cybern 45(3):548–561

    Article  Google Scholar 

  226. Yi S (2016) Pedestrian behavior modeling and understanding in crowds. The Chinese University of Hong Kong, Hong Kong

    Google Scholar 

  227. Cao L, Zhang X, Ren W, Huang K (2015) Large scale crowd analysis based on convolutional neural network. Pattern Recogn 48(10):3016–3024

    Article  Google Scholar 

  228. Kang K, Wang X (2014). Fully convolutional neural networks for crowd segmentation. arXiv preprint arXiv:1411.4464

  229. Shao J, Loy CC, Kang K, Wang X (2016) Slicing convolutional neural network for crowd video understanding. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 5620–5628

  230. Burney A, Syed TQ (2016) Crowd video classification using convolutional neural networks. In: 2016 international conference on Frontiers of information technology (FIT), IEEE, pp 247–251

  231. Sabokrou M, Fayyaz M, Fathy M, Klette R (2016) Fully convolutional neural network for fast anomaly detection in crowded scenes. arXiv preprint arXiv:1609.00866

  232. Zhou S, Shen W, Zeng D, Fang M, Wei Y, Zhang Z (2016) Spatial–temporal convolutional neural networks for anomaly detection and localization in crowded scenes. Signal Process: Image Commun 47:358–368

    Google Scholar 

  233. Smeureanu S, Ionescu RT, Popescu M, Alexe B (2017) Deep appearance features for abnormal behavior detection in video. In: International conference on image analysis and processing. Springer, Cham, pp 779–789

    Google Scholar 

  234. Halder R, Chatterjee R (2020) CNN-BiLSTM model for violence detection in smart surveillance. SN Comput Sci 1(4):1–9

    Article  Google Scholar 

  235. Ding C, Fan S, Zhu M, Feng W, Jia B (2014) Violence detection in video by using 3D convolutional neural networks. In: International symposium on visual computing. Springer, Cham, pp 551–558

    Google Scholar 

  236. Li J, Jiang X, Sun T, Xu K (2019) Efficient violence detection using 3d convolutional neural networks. In: 2019 16th IEEE international conference on advanced video and signal based surveillance (AVSS), IEEE, pp 1–8

  237. http://visilab.etsii.uclm.es/personas/oscar/FightDetection/

  238. Demarty CH, Penet C, Soleymani M, Gravier G (2015) VSD, a public dataset for the detection of violent scenes in movies: design, annotation, analysis and evaluation. Multimed Tools Appl 74(17):7379–7404

    Article  Google Scholar 

  239. Li C, Zhu L, Zhu D, Chen J, Pan Z, Li X, Wang B (2018) End-to-end multiplayer violence detection based on deep 3D CNN. In: Proceedings of the 2018 VII international conference on network, communication and computing, pp 227–230

  240. Arandjelovic R, Gronat P, Torii A, Pajdla T, Sivic J (2016) NetVLAD: CNN architecture for weakly supervised place recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 5297–5307

  241. Mu G, Cao H, Jin Q (2016) Violent scene detection using convolutional neural networks and deep audio features. In: Chinese conference on pattern recognition. Springer, Singapore, pp 451–463

    Chapter  Google Scholar 

  242. Abdali AMR, Al-Tuma RF (2019) Robust real-time violence detection in video using cnn and lstm. In: 2019 2nd scientific conference of computer sciences (SCCS), IEEE, pp 104–108

  243. Mumtaz A, Sargano AB, Habib Z (2018) Violence detection in surveillance videos with deep network using transfer learning. In: 2018 2nd European conference on electrical engineering and computer science (EECS), IEEE, pp 558–563

  244. Sudhakaran S, Lanz O (2017) Learning to detect violent videos using convolutional long short-term memory. In: 2017 14th IEEE international conference on advanced video and signal based surveillance (AVSS), IEEE, pp 1–6

  245. Hanson A, Pnvr K, Krishnagopal S, Davis L (2018) Bidirectional convolutional lstm for the detection of violence in videos. In: Proceedings of the European conference on computer vision (ECCV) workshops

  246. Meng Z, Yuan J, Li Z (2017) Trajectory-pooled deep convolutional networks for violence detection in videos. In: International conference on computer vision systems. Springer, Cham, pp 437–447

    Chapter  Google Scholar 

  247. Song W, Zhang D, Zhao X, Yu J, Zheng R, Wang A (2019) A novel violent video detection scheme based on modified 3d convolutional neural networks. IEEE Access 7:39172–39179

    Article  Google Scholar 

  248. Ullah FUM, Ullah A, Muhammad K, Haq IU, Baik SW (2019) Violence detection using spatiotemporal features with 3D convolutional neural network. Sensors 19(11):2472

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Rajiv Singh.

Ethics declarations

Conflict of interest

There is no conflict of interests regarding the publication of this manuscript.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Tyagi, B., Nigam, S. & Singh, R. A Review of Deep Learning Techniques for Crowd Behavior Analysis. Arch Computat Methods Eng 29, 5427–5455 (2022). https://doi.org/10.1007/s11831-022-09772-1

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11831-022-09772-1