CNA-DeepSORT algorithm for multi-target tracking

Kaili Feng¹,
Wenxiao Huo¹,
Wenhao Xu²,
Meng Li¹ &
…
Tianping Li¹

528 Accesses
2 Citations
Explore all metrics

Abstract

In recent years, multi-target tracking algorithms have been developed rapidly. However, in multi-target tracking, mutual occlusion and cross between targets and sudden disappearance and reappearance of targets in videos can easily occur, which could only result in missed detection, false detection, and wrong ID switching. To address the above problems, the CenterNet attention DeepSORT algorithm (CNA-DeepSORT) proposed in this paper incorporates a CenterNet network with channel attention mechanism in the original detection part of the DeepSORT algorithm instead of Faster R-CNN, and designs a multi-scale feature extraction module with the DeepSORT algorithm in the multi-scale feature extraction module and designed a pedestrian recognition network combined with the DeepSORT algorithm. These improvements lead to a 3.7% improvement in MOTA metric, 1.6% improvement in MOTP metric, 238 fewer false ID switches, 2627 fewer FP metrics, 3943 fewer FN metrics, a decrease in run speed, and a 4 Hz reduction in frame rate compared to the original DeepSORT algorithm. improved by 3. 7, and there is some improvement in handling the occlusion problem of multi-target tracking, and the false and missed detection of targets during ID switching is reduced.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic

£29.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price includes VAT (United Kingdom)

Instant access to the full article PDF.

Institutional subscriptions

Pedestrian Multi-object Tracking Algorithm Based on Attention Feature Fusion

Multi-object tracking using context-sensitive enhancement via feature fusion

Article 27 July 2023

Pedestrian Multi-object Tracking Based on ResNeXt and FairMOT

Data availability

Not applicable for that section.

References

Arulampalam MS, Maskell S, Gordon N et al (2002) A Tutorial on Particle Filters for Online Nonlinear/Non-Gaussian Bayesian Tracking[J]. IEEE Trans Signal Process 50(2):174–188
Article Google Scholar
Bewley A, Ge Z, Ott L, et al. Simple Online and Realtime Tracking[C]// 2016 IEEE International Conference on Image Processing (ICIP). IEEE, 2016.
Bhatti UA, et al. 2022. Local Similarity-Based Spatia–Spectral Fusion Hyperspectral Image Classification With Deep CNN and Gabor Filtering," in IEEE Transactions on Geoscience and Remote Sensing, vol. 60, pp. 1–15, Art no. 5514215. https://doi.org/10.1109/TGRS.2021.3090410.
Bhatti UA et al (2021) Time Series Analysis and Forecasting of Air Pollution Particulate Matter (PM2.5): An SARIMA and Factor Analysis Approach. IEEE Access 9:41019–41031. https://doi.org/10.1109/ACCESS.2021.3060744
Article Google Scholar
Bhatti UA, Huang M, Di Wu, Zhang Yu, Mehmood A, Han H (2019) Recommendation system using feature extraction and pattern recognition in clinical care systems. Ent Inform Syst 13(3):329–351. https://doi.org/10.1080/17517575.2018.1557256
Article Google Scholar
Bhatti UA, Huang M, Hao Wang Yu, Zhang AM, Di Wu (2018) Recommendation system for immunization coverage and monitoring. Hum Vaccin Immunother 14(1):165–171. https://doi.org/10.1080/21645515.2017.1379639
Article Google Scholar
Blackman SS (2004) Multiple hypothesis tracking for multiple target tracking[J]. IEEE Aerosp Electron Syst Mag 19(1):5–18
Article Google Scholar
Bossard L, Guillaumin. 2014. Gool L V . Food-101 – Mining Discriminative Components with Random Forests[J]. Springer International Publishing
Choi W. (2016) Near-Online Multi-target Tracking with Aggregated Local Flow Descriptor. IEEE Int Conf Comp Vis IEEE.
Everingham M, Winn J. (2011) The pascal visual object classes challenge 2012 (voc2012) development kit[J]. Pattern Analysis, Statistical Modelling and Computational Learning, Tech. Rep. 8: 5.
Fukunaga K, Hostetler L (1975) The estimation of the gradient of a density function, with applications in pattern recognition[J]. IEEE Trans Inf Theory 21(1):32–40
Article MathSciNet Google Scholar
Girshick R. 2015. Fast R-CNN[J]. Computer Science
Hu J, Shen L, Sun G. (2018) Squeeze-and-excitation networks[C]//Proceedings of the IEEE Conf Comp Vis Pattern Recog. 7132–7141.
Kuhn HW (2010) The Hungarian method for the assignment problem[J]. Nav Res Logist 52(1–2):7–21
Google Scholar
Lan W. (2020) Target tracking and risk avoidance system for intelligent driving system based on 5G signal anomaly detection[J]. International Journal of Communication Systems.
Leal-Taixé L, Milan A, Reid I, et al. (2015) Motchallenge 2015: Towards a benchmark for multi-target tracking[J]. arXiv preprint arXiv:1504.01942.
Lee B, Erdenee E, Jin S et al (2016) Multi-Class Multi-Object Tracking using Changing Point Detection[J]. Springer, Cham
Book Google Scholar
Liang Z, Zhi B, Sun Y et al (2016) MARS: A Video Benchmark for Large-Scale Person Re-Identification[J]. Springer, Cham
Google Scholar
Lin T Y, Goyal P , Girshick R, et al. (2017). Focal Loss for Dense Object Detection[J]. IEEE Trans Pattern Anal Mach Intell. (99):2999–3007.
Liu W, Anguelov D, Erhan D et al (2016) SSD: Single Shot MultiBox Detector[J]. Springer, Cham
Google Scholar
Luo W, Xing J, Zhang X , et al. Multiple Object Tracking: A Literature Review[J]. Eprint Arxiv, 2015.
Milan A, Leal-Taixe L, Reid I, et al. 2016. MOT16: A Benchmark for Multi-Object Tracking[J].
Redmon J, Divvala S , Girshick R , et al. (2016) You Only Look Once: Unified, Real-Time Object Detection[J]. IEEE.
Redmon J, Farhadi A. (2017). YOLO9000: Better, Faster, Stronger[J]. IEEE Conference on Computer Vision & Pattern Recognition. 6517–6525.
Redmon J , Farhadi A. (2018). YOLOv3: An Incremental Improvement[J]. arXiv e-prints.
Ren S, He K, Girshick R et al (2017) Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks[J]. IEEE Trans Pattern Anal Mach Intell 39(6):1137–1149
Article Google Scholar
Roccetti M, Delnevo G, Casini L, et al. (2019) Is bigger always better? A controversial journey to the center of machine learning design, with uses and misuses of big data for predicting water meter failures[J]. J Big Data. 6(1).
Roecker JA (1994) A class of near optimal JPDA algorithms[J]. IEEE Trans Aerosp Electron Syst 30(2):504–510
Article Google Scholar
Roecker JA (1994) A class of near optimal JPDA algorithms[J]. Aerospace Electr Syst IEEE Trans 30(2):504–510
Article Google Scholar
Rui L A , Bz A , Zhu T A , et al. (2022) An End-to-End Identity Association Network based on Geometry Refinement for Multi-Object Tracking[J]. Pattern Recognition.
Sanchez-Matilla R, Poiesi F, Cavallaro A. (2016) Online Multi-target Tracking with Strong and Weak Detections. Springer Int Pub. Springer International Publishing.
Simonyan K, Zisserman A. (2014) Very deep convolutional networks for large-scale image recognition[J]. arXiv preprint arXiv:1409.1556.
Smiatek J, Jung A, Bluhmki E. (2021) Validation Is Not Verification: Precise Terminology and Scientific Methods in Bioprocess Modeling, Trends in Biotechnology, 39(11): (1117–1119).
Strippoli S. Personal Digital Assistant (PDA)[J]. 2002.
Strippoli S. (2002) Personal Digital Assistant (PDA)[J].
Sun S, Akhtar N, Song H, et al. 2019. Deep Affinity Network for Multiple Object Tracking[J]. IEEE Trans on Pattern Anal Mach Intell. PP(99).
Xiang Y, Alahi A, Savarese S. (2015) Learning to track: Online multi-object tracking by decision making[C]. Proc IEEE Int Conf Comp Vis. 4705–4713.
Yang B, Yan J, Lei Z, et al. 2014. Aggregate channel features for multi-view face detection[C]. IEEE Int Joint Conf Biomet. IEEE. 1–8.
Yu F, Li W, Li Q, et al. (2016) Poi: Multiple object tracking with high performance detection and appearance feature. Eur Conf Comp Vis. Springer, Cham. 36–42.
Yu F, Wang D, Shelhamer E , et al. (2017). Deep Layer Aggregation[J]. arXiv
Zagoruyko S. (2016) Komodakis N . Wide Residual Networks[J].
Zeiler M D, Fergus R. (2014) Visualizing and understanding convolutional networks[C]. Eur Conf Comp Vis. Springer, Cham. 818–833.
Zheng Z, Zheng L, Yang Y. 2017. Pedestrian Alignment Network for Large-scale Person Re-identification[J]. IEEE Trans Circuits Syst Vid Tech. PP(99).
Zhou X, Wang D, Krähenbühl P. 2019. Objects as points[J]. arXiv preprint arXiv:1904.07850.

Download references

Author information

Authors and Affiliations

School of Physics and Electronics, Shandong Normal University, Jinan, 250014, Shandong, China
Kaili Feng, Wenxiao Huo, Meng Li & Tianping Li
School of Physics and Electronics Information, Dezhou University, Dezhou, 253023, Shandong, China
Wenhao Xu

Authors

Kaili Feng
View author publications
You can also search for this author in PubMed Google Scholar
Wenxiao Huo
View author publications
You can also search for this author in PubMed Google Scholar
Wenhao Xu
View author publications
You can also search for this author in PubMed Google Scholar
Meng Li
View author publications
You can also search for this author in PubMed Google Scholar
Tianping Li
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Formal analysis, Kaili Feng and Wenxiao Huo; Methodology, Kaili Feng and Wenxiao Huo; Supervision, Wenhao Xu and Tianping Li; Writing—original draft, Kaili Feng and Wenxiao Huo; Writing—review & editing, Kaili Feng and Meng Li.

Corresponding author

Correspondence to Tianping Li.

Ethics declarations

Ethical approval

This paper is an original research achievement obtained by the author, and no one draft is submitted more than once; The content of the paper does not involve state secrets; It has not been published in any form in any language at home and abroad; The content of the paper shall not infringe upon the copyright and other rights of others. In case of multiple submissions, infringement, disclosure and other problems, the author of the paper will bear full responsibility.

Consent to participate

All authors consent to participate.

Consent for publication

All authors consent for publication.

Conflicts of interest

The authors declare no conflict of interest. The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Feng, K., Huo, W., Xu, W. et al. CNA-DeepSORT algorithm for multi-target tracking. Multimed Tools Appl 83, 4731–4755 (2024). https://doi.org/10.1007/s11042-023-15813-z

Download citation

Received: 22 April 2022
Revised: 26 March 2023
Accepted: 10 May 2023
Published: 29 May 2023
Issue Date: January 2024
DOI: https://doi.org/10.1007/s11042-023-15813-z

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Pedestrian Multi-object Tracking Algorithm Based on Attention Feature Fusion

Multi-object tracking using context-sensitive enhancement via feature fusion

Pedestrian Multi-object Tracking Based on ResNeXt and FairMOT

Data availability

References

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethical approval

Consent to participate

Consent for publication

Conflicts of interest

Additional information

Publisher's note

Rights and permissions

About this article

Cite this article

Keywords

Subscribe and save

Buy Now

Navigation

CNA-DeepSORT algorithm for multi-target tracking

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Pedestrian Multi-object Tracking Algorithm Based on Attention Feature Fusion

Multi-object tracking using context-sensitive enhancement via feature fusion

Pedestrian Multi-object Tracking Based on ResNeXt and FairMOT

Data availability

References

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethical approval

Consent to participate

Consent for publication

Conflicts of interest

Additional information

Publisher's note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Search

Navigation