Abstract
In recent years, multi-target tracking algorithms have been developed rapidly. However, in multi-target tracking, mutual occlusion and cross between targets and sudden disappearance and reappearance of targets in videos can easily occur, which could only result in missed detection, false detection, and wrong ID switching. To address the above problems, the CenterNet attention DeepSORT algorithm (CNA-DeepSORT) proposed in this paper incorporates a CenterNet network with channel attention mechanism in the original detection part of the DeepSORT algorithm instead of Faster R-CNN, and designs a multi-scale feature extraction module with the DeepSORT algorithm in the multi-scale feature extraction module and designed a pedestrian recognition network combined with the DeepSORT algorithm. These improvements lead to a 3.7% improvement in MOTA metric, 1.6% improvement in MOTP metric, 238 fewer false ID switches, 2627 fewer FP metrics, 3943 fewer FN metrics, a decrease in run speed, and a 4 Hz reduction in frame rate compared to the original DeepSORT algorithm. improved by 3. 7, and there is some improvement in handling the occlusion problem of multi-target tracking, and the false and missed detection of targets during ID switching is reduced.
Similar content being viewed by others
Data availability
Not applicable for that section.
References
Arulampalam MS, Maskell S, Gordon N et al (2002) A Tutorial on Particle Filters for Online Nonlinear/Non-Gaussian Bayesian Tracking[J]. IEEE Trans Signal Process 50(2):174–188
Bewley A, Ge Z, Ott L, et al. Simple Online and Realtime Tracking[C]// 2016 IEEE International Conference on Image Processing (ICIP). IEEE, 2016.
Bhatti UA, et al. 2022. Local Similarity-Based Spatia–Spectral Fusion Hyperspectral Image Classification With Deep CNN and Gabor Filtering," in IEEE Transactions on Geoscience and Remote Sensing, vol. 60, pp. 1–15, Art no. 5514215. https://doi.org/10.1109/TGRS.2021.3090410.
Bhatti UA et al (2021) Time Series Analysis and Forecasting of Air Pollution Particulate Matter (PM2.5): An SARIMA and Factor Analysis Approach. IEEE Access 9:41019–41031. https://doi.org/10.1109/ACCESS.2021.3060744
Bhatti UA, Huang M, Di Wu, Zhang Yu, Mehmood A, Han H (2019) Recommendation system using feature extraction and pattern recognition in clinical care systems. Ent Inform Syst 13(3):329–351. https://doi.org/10.1080/17517575.2018.1557256
Bhatti UA, Huang M, Hao Wang Yu, Zhang AM, Di Wu (2018) Recommendation system for immunization coverage and monitoring. Hum Vaccin Immunother 14(1):165–171. https://doi.org/10.1080/21645515.2017.1379639
Blackman SS (2004) Multiple hypothesis tracking for multiple target tracking[J]. IEEE Aerosp Electron Syst Mag 19(1):5–18
Bossard L, Guillaumin. 2014. Gool L V . Food-101 – Mining Discriminative Components with Random Forests[J]. Springer International Publishing
Choi W. (2016) Near-Online Multi-target Tracking with Aggregated Local Flow Descriptor. IEEE Int Conf Comp Vis IEEE.
Everingham M, Winn J. (2011) The pascal visual object classes challenge 2012 (voc2012) development kit[J]. Pattern Analysis, Statistical Modelling and Computational Learning, Tech. Rep. 8: 5.
Fukunaga K, Hostetler L (1975) The estimation of the gradient of a density function, with applications in pattern recognition[J]. IEEE Trans Inf Theory 21(1):32–40
Girshick R. 2015. Fast R-CNN[J]. Computer Science
Hu J, Shen L, Sun G. (2018) Squeeze-and-excitation networks[C]//Proceedings of the IEEE Conf Comp Vis Pattern Recog. 7132–7141.
Kuhn HW (2010) The Hungarian method for the assignment problem[J]. Nav Res Logist 52(1–2):7–21
Lan W. (2020) Target tracking and risk avoidance system for intelligent driving system based on 5G signal anomaly detection[J]. International Journal of Communication Systems.
Leal-Taixé L, Milan A, Reid I, et al. (2015) Motchallenge 2015: Towards a benchmark for multi-target tracking[J]. arXiv preprint arXiv:1504.01942.
Lee B, Erdenee E, Jin S et al (2016) Multi-Class Multi-Object Tracking using Changing Point Detection[J]. Springer, Cham
Liang Z, Zhi B, Sun Y et al (2016) MARS: A Video Benchmark for Large-Scale Person Re-Identification[J]. Springer, Cham
Lin T Y, Goyal P , Girshick R, et al. (2017). Focal Loss for Dense Object Detection[J]. IEEE Trans Pattern Anal Mach Intell. (99):2999–3007.
Liu W, Anguelov D, Erhan D et al (2016) SSD: Single Shot MultiBox Detector[J]. Springer, Cham
Luo W, Xing J, Zhang X , et al. Multiple Object Tracking: A Literature Review[J]. Eprint Arxiv, 2015.
Milan A, Leal-Taixe L, Reid I, et al. 2016. MOT16: A Benchmark for Multi-Object Tracking[J].
Redmon J, Divvala S , Girshick R , et al. (2016) You Only Look Once: Unified, Real-Time Object Detection[J]. IEEE.
Redmon J, Farhadi A. (2017). YOLO9000: Better, Faster, Stronger[J]. IEEE Conference on Computer Vision & Pattern Recognition. 6517–6525.
Redmon J , Farhadi A. (2018). YOLOv3: An Incremental Improvement[J]. arXiv e-prints.
Ren S, He K, Girshick R et al (2017) Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks[J]. IEEE Trans Pattern Anal Mach Intell 39(6):1137–1149
Roccetti M, Delnevo G, Casini L, et al. (2019) Is bigger always better? A controversial journey to the center of machine learning design, with uses and misuses of big data for predicting water meter failures[J]. J Big Data. 6(1).
Roecker JA (1994) A class of near optimal JPDA algorithms[J]. IEEE Trans Aerosp Electron Syst 30(2):504–510
Roecker JA (1994) A class of near optimal JPDA algorithms[J]. Aerospace Electr Syst IEEE Trans 30(2):504–510
Rui L A , Bz A , Zhu T A , et al. (2022) An End-to-End Identity Association Network based on Geometry Refinement for Multi-Object Tracking[J]. Pattern Recognition.
Sanchez-Matilla R, Poiesi F, Cavallaro A. (2016) Online Multi-target Tracking with Strong and Weak Detections. Springer Int Pub. Springer International Publishing.
Simonyan K, Zisserman A. (2014) Very deep convolutional networks for large-scale image recognition[J]. arXiv preprint arXiv:1409.1556.
Smiatek J, Jung A, Bluhmki E. (2021) Validation Is Not Verification: Precise Terminology and Scientific Methods in Bioprocess Modeling, Trends in Biotechnology, 39(11): (1117–1119).
Strippoli S. Personal Digital Assistant (PDA)[J]. 2002.
Strippoli S. (2002) Personal Digital Assistant (PDA)[J].
Sun S, Akhtar N, Song H, et al. 2019. Deep Affinity Network for Multiple Object Tracking[J]. IEEE Trans on Pattern Anal Mach Intell. PP(99).
Xiang Y, Alahi A, Savarese S. (2015) Learning to track: Online multi-object tracking by decision making[C]. Proc IEEE Int Conf Comp Vis. 4705–4713.
Yang B, Yan J, Lei Z, et al. 2014. Aggregate channel features for multi-view face detection[C]. IEEE Int Joint Conf Biomet. IEEE. 1–8.
Yu F, Li W, Li Q, et al. (2016) Poi: Multiple object tracking with high performance detection and appearance feature. Eur Conf Comp Vis. Springer, Cham. 36–42.
Yu F, Wang D, Shelhamer E , et al. (2017). Deep Layer Aggregation[J]. arXiv
Zagoruyko S. (2016) Komodakis N . Wide Residual Networks[J].
Zeiler M D, Fergus R. (2014) Visualizing and understanding convolutional networks[C]. Eur Conf Comp Vis. Springer, Cham. 818–833.
Zheng Z, Zheng L, Yang Y. 2017. Pedestrian Alignment Network for Large-scale Person Re-identification[J]. IEEE Trans Circuits Syst Vid Tech. PP(99).
Zhou X, Wang D, Krähenbühl P. 2019. Objects as points[J]. arXiv preprint arXiv:1904.07850.
Author information
Authors and Affiliations
Contributions
Formal analysis, Kaili Feng and Wenxiao Huo; Methodology, Kaili Feng and Wenxiao Huo; Supervision, Wenhao Xu and Tianping Li; Writing—original draft, Kaili Feng and Wenxiao Huo; Writing—review & editing, Kaili Feng and Meng Li.
Corresponding author
Ethics declarations
Ethical approval
This paper is an original research achievement obtained by the author, and no one draft is submitted more than once; The content of the paper does not involve state secrets; It has not been published in any form in any language at home and abroad; The content of the paper shall not infringe upon the copyright and other rights of others. In case of multiple submissions, infringement, disclosure and other problems, the author of the paper will bear full responsibility.
Consent to participate
All authors consent to participate.
Consent for publication
All authors consent for publication.
Conflicts of interest
The authors declare no conflict of interest. The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Feng, K., Huo, W., Xu, W. et al. CNA-DeepSORT algorithm for multi-target tracking. Multimed Tools Appl 83, 4731–4755 (2024). https://doi.org/10.1007/s11042-023-15813-z
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-023-15813-z