Abstract
This paper proposes an enhanced object tracking architecture based on YOLOv3 and DeepSORT. YOLOv3 is used for object detection, but in this case, we have only selected the human class. For object tracking, the DeepSORT, Kalman filter, and Hungarian algorithm are used. The occlusion issue is solved using the Kalman filter. The parallel approach is used in the suggested architecture to increase the base method's speed. In a parallel approach, the input frame is fed simultaneously to the object detector and the object tracker, enabling the tracker to begin following a person as soon as the object detector recognizes them. Here, occlusion and speed are the two issues that we are concentrating on. The experimental outcomes using specific data demonstrate positive outcomes with faster operation.
References
Voigtlaender P, Krause M, Osep A, Luiten J, Sekar BBG, Geiger A, Leibe B (2019) Mots: multi-object tracking and segmentation. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 7942–7951
Luo W, Xing J, Milan A, Zhang X, Liu W, Kim TK (2021) Multiple object tracking: a literature review. Artif Intell 293:103448
Ciaparrone G, Sánchez FL, Tabik S, Troiano L, Tagliaferri R, Herrera F (2020) Deep learning in video multi-object tracking: a survey. Neurocomputing 381:61–88
Emami P, Pardalos PM, Elefteriadou L, Ranka S (2020) Machine learning methods for data association in multi-object tracking. ACM Comput Surv (CSUR) 53(4):1–34
Jainul Rinosha SM, Augasta G (2021) Review of recent advances in visual tracking techniques. Multimedia Tools Appl 80(16):24185–24203
Jeong Y, Son S, Jeong E, Lee B (2018) An integrated self-diagnosis system for an autonomous vehicle based on an IoT gateway and deep learning. Appl Sci 8(7):1164
Bansal P, Kockelman KM (2018) Are we ready to embrace connected and self-driving vehicles? A case study of Texans. Transportation 45(2):641–675
Lotfi F, Taghirad HD (2021) A framework for 3d tracking of frontal dynamic objects in autonomous cars. Expert Syst Appl 183:115343
Shi X, Chai X, Xie J, Sun T (2022) Mc-gcn: a multi-scale contrastive graph convolutional network for unconstrained face recognition with image sets. IEEE Trans Image Process 31:3046–3055
Neto JBC, Ferrari C, Marana AN, Berretti S, Bimbo AD (2022) Learning streamed attention network from descriptor images for cross-resolution 3d face recognition. ACM Trans Multimedia Comput Commun Appl (TOMM) 19(1):1–20
Rezaei F, Yazdi M (2021) Real-time crowd behavior recognition in surveillance videos based on deep learning methods. J Real-Time Image Proc 18(5):1669–1679
Swathi HY, Shivakumar G (2021) Hybrid feature-assisted neural model for crowd behavior analysis. SN Comput Sci 2(4):1–11
Mekler ED, Hornbæk K (2019) A framework for the experience of meaning in human-computer interaction. In: Proceedings of the 2019 CHI conference on human factors in computing systems, pp 1–15
Kashef M, Visvizi A, Troisi O (2021) Smart city as a smart service system: human-computer interaction and smart city surveillance systems. Comput Hum Behav 124:106923
Siddique A, Medeiros H (2020) Tracking passengers and baggage items using multi-camera systems at security checkpoints. arXiv preprint arXiv:2007.07924
Xu R, Nikouei SY, Chen Y, Polunchenko A, Song S, Deng C, Faughnan TR (2018) Real-time human objects tracking for smart surveillance at the edge. In: 2018 IEEE international conference on communications (ICC). IEEE, pp 1–6
Wang Z, Zheng L, Liu Y, Li Y, Wang S (2020) Towards real-time multi-object tracking. In: European conference on computer vision. Springer, Berlin, pp 107–122
Brunetti A, Buongiorno D, Trotta GF, Bevilacqua V (2018) Computer vision and deep learning techniques for pedestrian detection and tracking: a survey. Neurocomputing 300:17–33
Zuo M, Zhu X, Chen Y, Yu J (2022) Survey of object tracking algorithm based on Siamese network. J Phys Conf Ser 2203:012035
Tong K, Wu Y, Zhou F (2020) Recent advances in small object detection based on deep learning: a review. Image Vis Comput 97:103910
Nguyen HQ, Nguyen TB, Le TA, Le TL, Vu TH, Noe A (2019) Comparative evaluation of human detection and tracking approaches for online tracking applications. In: 2019 international conference on advanced technologies for communications (ATC). IEEE, pp 348–353
Llamazares Á, Molinos EJ, Ocaña M (2018) Detection and tracking of moving obstacles (datmo): a review. Robotica 38(5):761–774
Redmon J, Farhadi A. Yolov3: an incremental improvement. arXiv preprint arXiv:1804.02767
Soleimanitaleb Z, Keyvanrad MA, Jafari A (2019) Object tracking methods: a review. In: 2019 9th international conference on computer and knowledge engineering (ICCKE). IEEE, pp 282–288
Rakai L, Song H, Sun S, Zhang W, Yang Y (2021) Data association in multiple object tracking: a survey of recent techniques. Expert Syst Appl 116300
Sugirtha T, Sridevi MA (2022) Survey on object detection and tracking in a video sequence. In: Proceedings of international conference on computational intelligence. Springer, Berlin, pp 15–29
Tan L, Dong X, Ma Y, Yu C (2018) A multiple object tracking algorithm based on yolo detection. In: 11th international congress on image and signal processing, biomedical engineering and informatics (CISP-BMEI). IEEE, pp 1–5
Sun S, Akhtar N, Song H, Mian A, Shah M (2019) Deep affinity network for multiple object tracking. IEEE Trans Pattern Anal Mach Intell 43(1):104–119
Lee S, Kim E (2018) Multiple object tracking via feature pyramid Siamese networks. IEEE Access 7:8181–8194
Saleh F, Aliakbarian S, Rezatofighi H, Salzmann M, Gould S (2021) Probabilistic tracklet scoring and inpainting for multiple object tracking. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 14329–14339
Peng J, Gu Y, Wang Y, Wang C, Li J, Huang F (2020) Dense scene multiple object tracking with box-plane matching. In: Proceedings of the 28th ACM international conference on multimedia, pp 4615–4619
Zhang J, Zhou S, Chang X, Wan F, Wang J, Wu Y, Huang D (2020) Multiple object tracking by flowing and fusing. arXiv preprint arXiv:2001.11180
Sundararaman R, De Almeida Braga C, Marchand E, Pettre J (2021) Tracking pedestrian heads in dense crowd. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 3865–3875
Stadler D, Beyerer J (2021) Multi-pedestrian tracking with clusters. In: 2021 17th IEEE international conference on advanced video and signal based surveillance (AVSS). IEEE, pp 1–10
Wojke N, Bewley A, Paulus D (2017) Simple online and realtime tracking with a deep association metric. In: 2017 IEEE international conference on image processing, pp 3645–3649
https://pjreddie.com/darknet/yolov1. Last accessed 2022/08/13
Redmon J, Farhadi A (2017) YOLO9000: better, faster, stronger. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 7263–7271
Wai YJ, Bin Mohd Yussof Z, Bin Salim SI, Chuan LK (2018) Fixed point implementation of Tiny-Yolo-v2 using OpenCL on FPGA. Int J Adv Comput Sci Appl 9(10):506–512
Adarsh P, Rathi P, Kumar M (2020) YOLO v3-tiny: object detection and recognition using one stage improved model. In: 2020 6th international conference on advanced computing and communication systems (ICACCS), Mar 6. IEEE, pp 687–694
Ge Z, Liu S, Wang F, Li Z, Sun J (2021) Yolox: exceeding yolo series in. arXiv preprint arXiv:2107.08430
Xu S, Wang X, Lv W, Chang Q, Cui C, Deng K, Lai B (2022) PP-YOLOE: an evolved version of YOLO. arXiv preprint arXiv:2203.16250
Huang X, Wang X, Lv W, Bai X, Long X, Deng K, Yoshie O (2021) PP-YOLOv2: a practical object detector. arXiv preprint arXiv:2104.10419
Multiple object tracking benchmark, https://motchallenge.net/data/MOT17/. Last accessed: 2022/08/13
Kim C, Li F, Ciptadi A, Rehg JM (2015) Multiple hypothesis tracking revisited. In: Proceedings of the IEEE international conference on computer vision, pp 4696–4704
Henschel R, Leal-Taixé L, Cremers D, Rosenhahn B (2018) Fusion of head and full-body detectors for multi-object tracking. In: Computer vision and pattern recognition workshops (CVPRW)
Keuper M, Tang S, Andres B, Brox T, Schiele B (2018) Motion segmentation and multiple object tracking by correlation co-clustering. IEEE Trans Pattern Anal Mach Intell 42(1):140–153
Chen L, Ai H, Zhuang Z, Shang C (2018) Real-time multiple people tracking with deeply learned candidate selection and person re-identification. ICME
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Tyagi, B., Nigam, S., Singh, R. (2023). Human Detection and Tracking Based on YOLOv3 and DeepSORT. In: Sharma, H., Shrivastava, V., Bharti, K.K., Wang, L. (eds) Communication and Intelligent Systems. ICCIS 2022. Lecture Notes in Networks and Systems, vol 686. Springer, Singapore. https://doi.org/10.1007/978-981-99-2100-3_11
Download citation
DOI: https://doi.org/10.1007/978-981-99-2100-3_11
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-2099-0
Online ISBN: 978-981-99-2100-3
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)