Abstract
Multi-target vehicle trajectory prediction holds significant importance in the field of autonomous driving. Accurate trajectory prediction can enhance the safety and efficiency of autonomous vehicles, reducing traffic accidents and congestion. However, existing methods often fall short when dealing with complex traffic scenarios. Traditional approaches typically rely on single-scale spatiotemporal feature extraction, which struggles to fully capture the complex dynamics of traffic across different temporal and spatial scales, especially in high-density traffic environments. To address these challenges, this thesis proposes a multi-target vehicle trajectory prediction method based on a Multi-Scale Graph Convolutional Network (MSGCN). This method integrates high-definition semantic maps and employs a spatiotemporal multi-head attention mechanism alongside an adaptive dynamic weighting module to achieve efficient multi-target vehicle trajectory prediction. Specifically, this thesis constructs a dynamic feature repository using vehicle subgraphs and lane subgraphs to stabilize model weight fluctuations, thereby more accurately reflecting actual traffic conditions. Experimental results on the Argoverse dataset demonstrate the effectiveness of our method. Specifically, our approach reduces the average displacement error (mADE) by 7% and enhances the final displacement error (mFDE) by 19% when compared to existing state-of-the-art models. Our code is made available at https://github.com/Garegreen/EfficientMSGCN.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Data availability
Data is provided within the manuscript or supplementary information files. Our code is made available at https://github.com/Garegreen/EfficientMSGCN.
References
Qiao Sun, Xin Huang J, Gu et al (2022) M2i: from factored marginal trajectory prediction to interactive prediction. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 6543–6552
Dooseop Choi KW (2022) Min Hierarchical latent structure for multi-modal vehicle trajectory forecasting. In: European conference on computer vision, pp 129–125
Jinxin Liu Y, Luo H, Xiong et al (2019) An integrated approach to probabilistic vehicle trajectory prediction via driver characteristic and intention estimation. In: International conference on intelligent transportation, pp 3526–3532
Yuan Y, Weng X, Ou Y et al (2021) Agentformer: agent-aware transformers for socio-temporal multi-agent forecasting. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 9813–9823
Wang M, Zhu X, Yu C et al (2022) Ganet: goal area network for motion forecasting. arXiv:abs/2209.09723
Lu Zhang P, Li J, Chen et al (2022) Trajectory prediction with graph-based dual-scale context fusion. In: IEEE/RSJ international conference on intelligent robots and systems (IROS), pp 11374–11381
Jia X, Sun L, Tomizuka M et al (2021) Ide-net: interactive driving event and pattern extraction from human data. In: IEEE robotics and automation letters, pp 3065–3072
Shaoshuai Shi L, Jiang D, Dai et al (2022) Motion transformer with global intention localization and local movement refinement. arXiv:abs/2209.13508
Yulong Cao C, Xiao A, Anandkumar et al (2022) Advdo: realistic adversarial attacks for trajectory prediction. In: European conference on computer vision, pp 36–52
Holger Caesar V, Bankiti, Alex H, Lang et al (2020) nuscenes: a multi-modal dataset for autonomous driving. In: Proc. IEEE Conf. on computer vision and pattern recognition (CVPR), pp 1–3
Chang MF, Lambert J, Sangkloy P et al (2019) Argoverse: 3d tracking and forecasting with rich maps. In: Proc. IEEE conf. on computer vision and pattern recognition (CVPR), pp 1–5
Gao J, Sun C, Zhao H et al (2020) Vectornet: Encoding HD maps and agent dynamics from vectorized representation In: IEEE/CVF conference on computer vision and pattern recognition (CVPR), pp 11522–11530
Liang M, Yang B, Hu R et al (2020) Learning lane graph representations for motion forecasting, pp 543–546
Chen X, Zhang H, Zhao F et al (2022) Intention-aware vehicle trajectory prediction based on spatial-temporal dynamic attention network for Internet of vehicles. IEEE Trans Intell Transp Syst 23:19471–19483
Liang M, Yang B, Hu R et al (2020) Learning lane graph representations for motion forecasting. In: Proc. Of the European conf. on computer vision (ECCV), pp 1–13
Ye M, Cao T, Chen Q (2021) TPCN: Temporal point cloud networks for motion forecasting. In: Proc. IEEE conf. on computer vision and pattern recognition (CVPR), pp 1–14
Thomas Gilles S, Sabatini D, Tsishkou et al (2022) Thomas: trajectory heat map output with learned multi-agent sampling. In: Proc. of the international conf. on learning representations (ICLR), pp 1–7
Ngiam J, Caine B, Vasudevan V et al (2022) Scene transformer: a unified architecture for predicting multiple agent trajectories. In: Proc. of the international conf. on learning representations, pp 1–13
Liu Y, Zhang J, Fang L et al (2021) Multimodal motion prediction with stacked transformers. In: IEEE/CVF conference on computer vision and pattern recognition (CVPR), pp 7573–7582
Tao Lu Y, Watanabe H Takada (2024) Geo-spatial and temporal relation driven vehicle motion prediction: a geographic perspective. IEEE Trans Intell Veh. https://doi.org/10.1109/TIV.2024.3407210
Tao Lu Y, Watanabe H, Takada (2022) A lightweight long-term vehicular motion prediction method leveraging spatial database and kinematic trajectory data. ISPRS Int J Geo-Inf 11(9):463–480
Huang Z, Mo X, Lv C (2021) Multi-modal motion prediction with transformer-based neural network for autonomous driving, In: IEEE international conference on robotics and automation (ICRA), pp 2605–2609
Zhang Y, Han JH, Kwon YW et al (2020) A new architecture of feature pyramid network for object detection. In: IEEE 6th international conference on computer and communications (ICCC), pp 1224–1228
Ba JL, Kiros JR, Hinton GE (2016) Layer normalization. arxiv:1607.06450
He K, Zhang X, Ren S et al (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 770–778
Liu D, Yu J (2009) Otsu method and K-means. In: 2009 Ninth International conference on hybrid intelligent systems. IEEE, pp 344–349
Girgis R, Golemo F, Codevilla F et al (2021) Latent variable sequential set transformers for joint multi-agent motion prediction. arxiv:2104.00563
Gu J, Sun C, Zhao H (2021) Densetnt: End-to-end trajectory prediction from dense goal sets. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 15303–15312
Zhou Z, Ye L, Wang J et al (2022) Hivt: hierarchical vector transformer for multi-agent motion prediction. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 8823–8833
Da F, Zhang Y (2022) Path-aware graph attention for HD maps in motion prediction. In: 2022 International conference on robotics and automation (ICRA), pp 6430–6436
Maosheng Ye T, Cao Q, Chen (2021) TPCN: temporal point cloud networks for motion forecasting. arXiv:abs/2103.03067
Funding
Not applicable.
Author information
Authors and Affiliations
Contributions
Wang and Gu wrote the main manuscript text and prepared figures. Cheng finished running and writing the code. Li and Huang reviewed the manuscript and refined it further.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Gu, X., Wang, J., Cheng, D. et al. Efficient multi-target vehicle trajectory prediction based on multi-scale graph convolution. Pattern Anal Applic 28, 12 (2025). https://doi.org/10.1007/s10044-024-01396-4
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s10044-024-01396-4