[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ Skip to main content
Log in

Efficient multi-target vehicle trajectory prediction based on multi-scale graph convolution

  • Original Article
  • Published:
Pattern Analysis and Applications Aims and scope Submit manuscript

Abstract

Multi-target vehicle trajectory prediction holds significant importance in the field of autonomous driving. Accurate trajectory prediction can enhance the safety and efficiency of autonomous vehicles, reducing traffic accidents and congestion. However, existing methods often fall short when dealing with complex traffic scenarios. Traditional approaches typically rely on single-scale spatiotemporal feature extraction, which struggles to fully capture the complex dynamics of traffic across different temporal and spatial scales, especially in high-density traffic environments. To address these challenges, this thesis proposes a multi-target vehicle trajectory prediction method based on a Multi-Scale Graph Convolutional Network (MSGCN). This method integrates high-definition semantic maps and employs a spatiotemporal multi-head attention mechanism alongside an adaptive dynamic weighting module to achieve efficient multi-target vehicle trajectory prediction. Specifically, this thesis constructs a dynamic feature repository using vehicle subgraphs and lane subgraphs to stabilize model weight fluctuations, thereby more accurately reflecting actual traffic conditions. Experimental results on the Argoverse dataset demonstrate the effectiveness of our method. Specifically, our approach reduces the average displacement error (mADE) by 7% and enhances the final displacement error (mFDE) by 19% when compared to existing state-of-the-art models. Our code is made available at https://github.com/Garegreen/EfficientMSGCN.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
£29.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price includes VAT (United Kingdom)

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14

Similar content being viewed by others

Explore related subjects

Discover the latest articles, news and stories from top researchers in related subjects.

Data availability

Data is provided within the manuscript or supplementary information files. Our code is made available at https://github.com/Garegreen/EfficientMSGCN.

References

  1. Qiao Sun, Xin Huang J, Gu et al (2022) M2i: from factored marginal trajectory prediction to interactive prediction. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 6543–6552

  2. Dooseop Choi KW (2022) Min Hierarchical latent structure for multi-modal vehicle trajectory forecasting. In: European conference on computer vision, pp 129–125

  3. Jinxin Liu Y, Luo H, Xiong et al (2019) An integrated approach to probabilistic vehicle trajectory prediction via driver characteristic and intention estimation. In: International conference on intelligent transportation, pp 3526–3532

  4. Yuan Y, Weng X, Ou Y et al (2021) Agentformer: agent-aware transformers for socio-temporal multi-agent forecasting. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 9813–9823

  5. Wang M, Zhu X, Yu C et al (2022) Ganet: goal area network for motion forecasting. arXiv:abs/2209.09723

  6. Lu Zhang P, Li J, Chen et al (2022) Trajectory prediction with graph-based dual-scale context fusion. In: IEEE/RSJ international conference on intelligent robots and systems (IROS), pp 11374–11381

  7. Jia X, Sun L, Tomizuka M et al (2021) Ide-net: interactive driving event and pattern extraction from human data. In: IEEE robotics and automation letters, pp 3065–3072

  8. Shaoshuai Shi L, Jiang D, Dai et al (2022) Motion transformer with global intention localization and local movement refinement. arXiv:abs/2209.13508

  9. Yulong Cao C, Xiao A, Anandkumar et al (2022) Advdo: realistic adversarial attacks for trajectory prediction. In: European conference on computer vision, pp 36–52

  10. Holger Caesar V, Bankiti, Alex H, Lang et al (2020) nuscenes: a multi-modal dataset for autonomous driving. In: Proc. IEEE Conf. on computer vision and pattern recognition (CVPR), pp 1–3

  11. Chang MF, Lambert J, Sangkloy P et al (2019) Argoverse: 3d tracking and forecasting with rich maps. In: Proc. IEEE conf. on computer vision and pattern recognition (CVPR), pp 1–5

  12. Gao J, Sun C, Zhao H et al (2020) Vectornet: Encoding HD maps and agent dynamics from vectorized representation In: IEEE/CVF conference on computer vision and pattern recognition (CVPR), pp 11522–11530

  13. Liang M, Yang B, Hu R et al (2020) Learning lane graph representations for motion forecasting, pp 543–546

  14. Chen X, Zhang H, Zhao F et al (2022) Intention-aware vehicle trajectory prediction based on spatial-temporal dynamic attention network for Internet of vehicles. IEEE Trans Intell Transp Syst 23:19471–19483

    Article  Google Scholar 

  15. Liang M, Yang B, Hu R et al (2020) Learning lane graph representations for motion forecasting. In: Proc. Of the European conf. on computer vision (ECCV), pp 1–13

  16. Ye M, Cao T, Chen Q (2021) TPCN: Temporal point cloud networks for motion forecasting. In: Proc. IEEE conf. on computer vision and pattern recognition (CVPR), pp 1–14

  17. Thomas Gilles S, Sabatini D, Tsishkou et al (2022) Thomas: trajectory heat map output with learned multi-agent sampling. In: Proc. of the international conf. on learning representations (ICLR), pp 1–7

  18. Ngiam J, Caine B, Vasudevan V et al (2022) Scene transformer: a unified architecture for predicting multiple agent trajectories. In: Proc. of the international conf. on learning representations, pp 1–13

  19. Liu Y, Zhang J, Fang L et al (2021) Multimodal motion prediction with stacked transformers. In: IEEE/CVF conference on computer vision and pattern recognition (CVPR), pp 7573–7582

  20. Tao Lu Y, Watanabe H Takada (2024) Geo-spatial and temporal relation driven vehicle motion prediction: a geographic perspective. IEEE Trans Intell Veh. https://doi.org/10.1109/TIV.2024.3407210

    Article  MATH  Google Scholar 

  21. Tao Lu Y, Watanabe H, Takada (2022) A lightweight long-term vehicular motion prediction method leveraging spatial database and kinematic trajectory data. ISPRS Int J Geo-Inf 11(9):463–480

    Article  MATH  Google Scholar 

  22. Huang Z, Mo X, Lv C (2021) Multi-modal motion prediction with transformer-based neural network for autonomous driving, In: IEEE international conference on robotics and automation (ICRA), pp 2605–2609

  23. Zhang Y, Han JH, Kwon YW et al (2020) A new architecture of feature pyramid network for object detection. In: IEEE 6th international conference on computer and communications (ICCC), pp 1224–1228

  24. Ba JL, Kiros JR, Hinton GE (2016) Layer normalization. arxiv:1607.06450

  25. He K, Zhang X, Ren S et al (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 770–778

  26. Liu D, Yu J (2009) Otsu method and K-means. In: 2009 Ninth International conference on hybrid intelligent systems. IEEE, pp 344–349

  27. Girgis R, Golemo F, Codevilla F et al (2021) Latent variable sequential set transformers for joint multi-agent motion prediction. arxiv:2104.00563

  28. Gu J, Sun C, Zhao H (2021) Densetnt: End-to-end trajectory prediction from dense goal sets. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 15303–15312

  29. Zhou Z, Ye L, Wang J et al (2022) Hivt: hierarchical vector transformer for multi-agent motion prediction. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 8823–8833

  30. Da F, Zhang Y (2022) Path-aware graph attention for HD maps in motion prediction. In: 2022 International conference on robotics and automation (ICRA), pp 6430–6436

  31. Maosheng Ye T, Cao Q, Chen (2021) TPCN: temporal point cloud networks for motion forecasting. arXiv:abs/2103.03067

Download references

Funding

Not applicable.

Author information

Authors and Affiliations

Authors

Contributions

Wang and Gu wrote the main manuscript text and prepared figures. Cheng finished running and writing the code. Li and Huang reviewed the manuscript and refined it further.

Corresponding author

Correspondence to Dengyang Cheng.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Gu, X., Wang, J., Cheng, D. et al. Efficient multi-target vehicle trajectory prediction based on multi-scale graph convolution. Pattern Anal Applic 28, 12 (2025). https://doi.org/10.1007/s10044-024-01396-4

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s10044-024-01396-4

Keywords

Navigation