Abstract
To ensure safety, most public spaces now deploy monitoring systems. However, in most scenarios, the tracking operations of these monitoring systems are performed manually. These operations should be automated. This paper proposes using a conditional random field (CRF) to formulate the automatic execution problem as a cost minimization problem. The appearance of pedestrians and the time taken by them to cross the view of a camera are used to solve the automatic execution problem. Crowd psychology is used to define constraints and construct a CRF graph. A Siamese convolutional neural network is employed to recognize pedestrian appearance. The time spent by pedestrians crossing the view of a camera is modeled using a normal distribution. The results of two models are considered as the costs of nodes and edges. The proposed algorithm is applied under constraints to determine matches at the minimum cost. The accuracy of the proposed method is compared with that of other methods by using common datasets and benchmarks. Superior results are obtained when both appearance and spatiotemporal information are employed for solving the automatic execution problem than when using appearance alone.
Similar content being viewed by others
References
Shitrit HB, Berclaz J, Fleuret F, Fua P (2014) Multi-commodity network flow for tracking multiple people. IEEE Trans Pattern Anal Mach Intell 36(8):1614–1627
Chen X, Le A, Bhanu B (2015) Multitarget tracking in nonoverlapping cameras using a reference set. IEEE Sens J 15(5):2692–2704
Chen X, Huang K, Tan T (2014) Object tracking across non-overlapping views by learning inter-camera transfer models. Pattern Recogn 47(3):1126–1137
Chen X, Bhanu B (2017) Integrating social grouping for multitarget tracking across cameras in a CRF model. IEEE Trans Circuits Syst Video Technol 27(11):2382–2394
Helbing D, Molnar P (1995) Social force model for pedestrian dynamics. Phys Rev E 51(5):4282
Aveni AF (1977) The not-so-lonely crowd: Friendship groups in collective behavior. Sociometry, pp 96–99
Moussaïd M, Perozo N, Garnier S, Helbing D, Theraulaz G (2010) The walking behaviour of pedestrian social groups and its impact on crowd dynamics. PLoS ONE 5(4):e10047
Ge W, Collins TR, Ruback RB (2012) Vision-based analysis of small groups in pedestrian crowds. IEEE Tans Pattern Anal Mach Intell 34(5):1003–1016
Javed O, Rasheed Z, Shafique K, Shah M (2008) Tracking across multiple cameras with disjoint views. In: Proceedings of the Ninth IEEE International Conference on Computer Vision, pp 952
Makris D, Ellis T, Black J (2004) Bridging the gaps between cameras. In: Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2004), vol 2
Hadsell R, Chopra S, LeCun Y (2006) Dimensionality reduction by learning an invariant mapping. In: 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR2006), pp 1735–1742
Heili A, Chen C, Odobez J—M (2011) Detection-based multi-human tracking using a CRF model. In IEEE International Conference on Computer Vision Workshops (ICCV Workshops)
Yang B, Nevatia R (2012) An online learned CRF model for multi-target tracking. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR2012)
Reinhard E, Adhikhmin M, Gooch B (2001) Color transfer between images. IEEE Comput Graphics Appl 21(5):34–41
Kolmogorov V, Zabih R (2004) What energy functions can be minimized via graph cut? IEEE Trans Patten Anal Mach Intell 2:147–159
NLPR MCT Dataset. [Online]. http://mct.idealtest.org/Datasets.html
Chen W, Cao L, Chen X, Huang K (2017) An equalized global graph model-based approach for multicamera object tracking. IEEE Trans Circuits Syst Video Technol 27(11):2367–2381
Acknowledgements
This work was supported in part by the "Allied Advanced Intelligent Biomedical Research Center, STUST" from Higher Education Sprout Project, Ministry of Education, Taiwan, and in part by the Ministry of Science and Technology (MOST) of Taiwan under Grant MOST 109-2221-E-218-026.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Cheng, ST., Hsu, CW., Horng, GJ. et al. Across-camera object tracking using a conditional random field model. J Supercomput 77, 14252–14279 (2021). https://doi.org/10.1007/s11227-021-03862-w
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11227-021-03862-w