Parameter-Efficient Vehicle Trajectory Prediction Based on Attention-Enhanced Liquid Structural Neural Model
<p>Overall framework of the proposed Attn-LTC model based on Liquid Time-Constant (LTC) networks for trajectory prediction. The proposed algorithm is composed of data preprocessing and vectorization, Temporal Attention-enhanced LTC Encoder, and Spatial Attention-enhanced LTC Decoder modules.</p> "> Figure 2
<p>The algorithmic flow of the Temporal Attention-enhanced LTC Encoder module in the proposed Attn-LTC model.</p> "> Figure 3
<p>Illustration of the Liquid Time-Constant (LTC) network structure.</p> "> Figure 4
<p>The algorithmic flow of the Spatial Attention-enhanced LTC Decoder module in the proposed Attn-LTC model.</p> "> Figure 5
<p>Visualization of training loss using various numbers of neurons from 8 to 64. (<b>a</b>) Training loss of proposed Attn-LTC models with 8, 16, 24, 32, or 64 neurons. (<b>b</b>) Training loss of Attn-LTC and Attn-LSTM models with 8, 16, or 24 neurons.</p> "> Figure 6
<p>Comparison of RMSE values for the proposed Attn-LTC model and LSTM counterparts. The number of neurons vary from 8 to 32. The prediction horizon ranges from 1 s to 5 s.</p> "> Figure 7
<p>Comparison of number of parameters in the proposed Attn-LTC model and Attn-LSTM baselines using various numbers of neurons from 8 to 32.</p> "> Figure 8
<p>Inference latency of the proposed Attn-LTC models using various numbers of neurons from 8 to 32.</p> "> Figure 9
<p>Comparison of RMSE values using the proposed Attn-LTC model under different lanes.</p> "> Figure 10
<p>Comparison of RMSE values using the proposed Attn-LTC model under different roadmaps and time.</p> "> Figure 11
<p>Comparison of RMSE values using the proposed Attn-LTC model for different numbers of neighboring vehicles. The number of neurons is 24.</p> "> Figure 12
<p>Visualization of the average temporal attention weights across different traffic lanes.</p> "> Figure 13
<p>Visualization of wiring patterns for LTC neurons. (<b>a</b>) Attn-LTC with 8 motor neurons and sensory neurons. (<b>b</b>) Attn-LTC with 16 motor neurons and sensory neurons. (<b>c</b>) Attn-LTC with 24 motor neurons and sensory neurons.</p> ">
Abstract
:1. Introduction
- A Temporal Attention-enhanced LTC Encoder is developed to effectively capture long-term temporal dependencies and dynamic behaviors from historical trajectory data.
- A Spatial Attention-enhanced LTC Decoder is introduced, emphasizing the influence of neighboring vehicles and spatial interactions to improve prediction accuracy.
- The model demonstrates significant computational efficiency, achieving superior prediction accuracy with a much smaller parameter size compared to traditional LSTM-based models [10,11,13]. Extensive experiments on the NGSIM dataset [17] validate the effectiveness of the Attn-LTC model, showcasing its suitability for real-time deployment in resource-constrained environments.
- Section 2 reviews the relevant literature on trajectory prediction, highlighting advancements in deep learning methods, attention mechanisms, and dependency modeling.
- Section 3 introduces the proposed Attn-LTC model, detailing its framework, representation techniques, encoding and decoding modules, and training methodology.
- Section 4 presents the experimental setup, evaluation metrics, baselines, and the results of performance comparisons and ablation studies.
- Section 5 concludes the paper, summarizing the findings and outlining potential directions for future research.
2. Related Literature Study
2.1. Deep Learning Methods for Trajectory Prediction
2.2. Dependency Modeling in Trajectory Prediction
2.2.1. Long Short-Term Memory (LSTM)
2.2.2. Attention Mechanism
3. Proposed Attn-LTC Model for Trajectory Prediction
3.1. Overall Framework
- First, the data preprocessing and vectorization module transforms the historical spatial grid information into a structured input format.
- Second, the Temporal Attention-enhanced LTC Encoder encodes the temporal dynamics of the target and neighboring vehicles by applying LTC cells enhanced with temporal attention to capture temporal dependencies.
- Finally, the Spatial Attention-enhanced LTC Decoder decodes the encoded states, using spatial attention to emphasize the influence of neighboring vehicles and predict the future trajectory of the target vehicle.
3.2. Spatial and Temporal Representation for Target and Neighbor Vehicles
3.3. Temporal Attention-Enhanced LTC Encoder for Trajectory Fusion
3.4. Spatial Attention-Enhanced LTC Decoder for Trajectory Prediction
3.5. Training
- Bounded Dynamics: Normalized weights inside the LTC cells are used to guarantees of numerical stability during inference and training. This mechanism ensures that the hidden states remain within finite ranges even for long sequences.
- Gradient Clipping: The recursive calculation of BPTT leads to explosive gradients, leading to instability in training. By capping gradients at a specified threshold, it ensures stable and efficient optimization, especially for long sequences.
4. Experiments
4.1. Dataset Specifications
4.2. Evaluation Metrics
4.3. Experimental Environment and Training Configurations
4.4. Baselines
- 1.
- Constant velocity (CV): The model uses a vehicle’s constant speed for trajectory prediction.
- 2.
- LSTM with fully connected social pooling (S-LSTM) [11]: The model incorporates a social pooling layer to capture interactions among individuals and predict future trajectories in crowded spaces.
- 3.
- LSTM with convolutional social pooling (CS-LSTM) [12]: An LSTM encoder–decoder model with a convolutional social pooling layer to improve interaction modeling between vehicles, combined with maneuver-based trajectory prediction for robust and multi-modal future predictions.
- 4.
- Multi-head attention LSTM (MHA-LSTM) [22]: The model leverages multi-head attention to capture higher-order interactions among vehicles and predict multi-modal trajectories, enabling long-range dependency modeling and accurate motion forecasting.
- 5.
- Dynamic and static context-aware attention network (DSCAN) [14]: The algorithm models inter-vehicle interactions using attention mechanisms and incorporates static environmental constraints for improved trajectory prediction accuracy.
- 6.
- Spatial interaction-aware Transformer (SIT) [15]: The model integrates temporal dependencies and spatial interactions through multi-head self-attention modules for precise long-term trajectory predictions.
- 7.
- Dual learning model (DLM) [29]: The model uses Occupancy Maps and Risk Maps in an encoder–decoder structure to capture inter-vehicle interactions and risk-based spatial relationships for accurate and efficient trajectory predictions.
- 8.
- Spatial–temporal attentive LSTM (STAM-LSTM) [13]: The model employs spatial and temporal attention mechanisms to extract critical features from historical trajectories for enhanced vehicle trajectory prediction.
4.5. Ablation Study
4.5.1. Ablation Experiments on Model Size
4.5.2. Ablation Experiments on LSTM and LTC
4.6. Comparison Results
4.7. Visualization and Qualitative Analysis
5. Discussion and Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Nayakanti, N.; Al-Rfou, R.; Zhou, A.; Goel, K.; Refaat, K.S.; Sapp, B. Wayformer: Motion forecasting via simple & efficient attention networks. In Proceedings of the 2023 IEEE International Conference on Robotics and Automation (ICRA), London, UK, 29 May–2 June 2023; IEEE: New York, NY, USA, 2023; pp. 2980–2987. [Google Scholar]
- Li, H.; Xing, W.; Jiao, H.; Yuen, K.F.; Gao, R.; Li, Y.; Matthews, C.; Yang, Z. Bi-directional information fusion-driven deep network for ship trajectory prediction in intelligent transportation systems. Transp. Res. Part E Logist. Transp. Rev. 2024, 192, 103770. [Google Scholar] [CrossRef]
- Song, H.; Ding, W.; Chen, Y.; Shen, S.; Wang, M.Y.; Chen, Q. Pip: Planning-informed trajectory prediction for autonomous driving. In Proceedings of the Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, 23–28 August 2020; Proceedings, Part XXI 16. Springer: New York, NY, USA, 2020; pp. 598–614. [Google Scholar]
- Xie, X.; Zhang, C.; Zhu, Y.; Wu, Y.N.; Zhu, S.C. Congestion-aware multi-agent trajectory prediction for collision avoidance. In Proceedings of the 2021 IEEE International Conference on Robotics and Automation (ICRA), Xi’an, China, 30 May–5 June 2021; IEEE: New York, NY, USA, 2021; pp. 13693–13700. [Google Scholar]
- Biktairov, Y.; Stebelev, M.; Rudenko, I.; Shliazhko, O.; Yangel, B. PRANK: Motion Prediction based on RANKing. In Proceedings of the Advances in Neural Information Processing Systems, Vancouver, BC, Canada, 6–12 December 2020. [Google Scholar]
- Casas, S.; Luo, W.; Urtasun, R. IntentNet: Learning to Predict Intention from Raw Sensor Data. In Proceedings of the 2nd Conference on Robot Learning, Zürich, Switzerland, 29–31 October 2018. [Google Scholar]
- Cui, H.; Radosavljevic, V.; Chou, F.C.; Lin, T.H.; Nguyen, T.; Huang, T.K.; Schneider, J.; Djuric, N. Multimodal trajectory predictions for autonomous driving using deep convolutional networks. In Proceedings of the 2019 International Conference on Robotics and Automation (ICRA), Montreal, QC, Canada, 20–24 May 2019; IEEE: New York, NY, USA, 2019; pp. 2090–2096. [Google Scholar]
- Djuric, N.; Radosavljevic, V.; Cui, H.; Nguyen, T.; Chou, F.C.; Lin, T.H.; Singh, N.; Schneider, J. Uncertainty-aware Short-term Motion Prediction of Traffic Actors for Autonomous Driving. In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), Snowmass, CO, USA, 1–5 March 2020. [Google Scholar]
- Zeng, W.; Liang, M.; Liao, R.; Urtasun, R. Lanercnn: Distributed representations for graph-centric motion forecasting. In Proceedings of the 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Prague, Czech Republic, 27 September–1 October 2021; IEEE: New York, NY, USA, 2021; pp. 532–539. [Google Scholar]
- Altché, F.; de La Fortelle, A. An LSTM network for highway trajectory prediction. In Proceedings of the 2017 IEEE 20th International Conference on Intelligent Transportation Systems (ITSC), Yokohama, Japan, 16–19 October 2017; IEEE: New York, NY, USA, 2017; pp. 353–359. [Google Scholar]
- Alahi, A.; Goel, K.; Ramanathan, V.; Robicquet, A.; Fei-Fei, L.; Savarese, S. Social lstm: Human trajectory prediction in crowded spaces. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 27–30 June 2016; pp. 961–971. [Google Scholar]
- Deo, N.; Trivedi, M.M. Convolutional social pooling for vehicle trajectory prediction. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, Salt Lake City, UT, USA, 18–23 June 2018; pp. 1468–1476. [Google Scholar]
- Jiang, R.; Xu, H.; Gong, G.; Kuang, Y.; Liu, Z. Spatial-temporal attentive LSTM for vehicle-trajectory prediction. ISPRS Int. J. Geo-Inf. 2022, 11, 354. [Google Scholar] [CrossRef]
- Yu, J.; Zhou, M.; Wang, X.; Pu, G.; Cheng, C.; Chen, B. A dynamic and static context-aware attention network for trajectory prediction. ISPRS Int. J. Geo-Inf. 2021, 10, 336. [Google Scholar] [CrossRef]
- Li, X.; Xia, J.; Chen, X.; Tan, Y.; Chen, J. SIT: A spatial interaction-aware transformer-based model for freeway trajectory prediction. ISPRS Int. J. Geo-Inf. 2022, 11, 79. [Google Scholar] [CrossRef]
- Hasani, R.; Lechner, M.; Amini, A.; Rus, D.; Grosu, R. Liquid time-constant networks. In Proceedings of the AAAI Conference on Artificial Intelligence, Vancouver, BC, Canada, 2–9 February 2021; Volume 35, pp. 7657–7666. [Google Scholar]
- Department of Transportation Federal Highway Administration. Next Generation Simulation (NGSIM) Vehicle Trajectories and Supporting Data. Available online: https://ops.fhwa.dot.gov/trafficanalysistools/ngsim.htm (accessed on 15 November 2024).
- Dong, H.; Zhuang, W.; Chen, B.; Yin, G.; Wang, Y. Enhanced eco-approach control of connected electric vehicles at signalized intersection with queue discharge prediction. IEEE Trans. Veh. Technol. 2021, 70, 5457–5469. [Google Scholar] [CrossRef]
- Dong, H.; Hu, Q.; Li, D.; Li, Z.; Song, Z. Predictive Battery Thermal and Energy Management for Connected and Automated Electric Vehicles. IEEE Trans. Intell. Transp. Syst. 2024, 1–13. [Google Scholar] [CrossRef]
- Dong, H.; Wang, Q.; Zhuang, W.; Yin, G.; Gao, K.; Li, Z.; Song, Z. Flexible eco-cruising strategy for connected and automated vehicles with efficient driving lane planning and speed optimization. IEEE Trans. Transp. Electrif. 2024, 10, 1530–1540. [Google Scholar] [CrossRef]
- Lin, L.; Li, W.; Bi, H.; Qin, L. Vehicle trajectory prediction using LSTMs with spatial–temporal attention mechanisms. IEEE Intell. Transp. Syst. Mag. 2021, 14, 197–208. [Google Scholar] [CrossRef]
- Messaoud, K.; Yahiaoui, I.; Verroust-Blondet, A.; Nashashibi, F. Attention based vehicle trajectory prediction. IEEE Trans. Intell. Veh. 2020, 6, 175–185. [Google Scholar] [CrossRef]
- Vaswani, A.; Shazeer, N.; Parmar, N.; Uszkoreit, J.; Jones, L.; Gomez, A.N.; Kaiser, Ł.; Polosukhin, I. Attention is all you need. Adv. Neural Inf. Process. Syst. 2017, 30, 6000–6010. [Google Scholar]
- Aydemir, G.; Akan, A.K.; Güney, F. Adapt: Efficient multi-agent trajectory prediction with adaptation. In Proceedings of the IEEE/CVF International Conference on Computer Vision, Paris, France, 1–6 October 2023; pp. 8295–8305. [Google Scholar]
- Wang, M.; Zhu, X.; Yu, C.; Li, W.; Ma, Y.; Jin, R.; Ren, X.; Ren, D.; Wang, M.; Yang, W. Ganet: Goal area network for motion forecasting. In Proceedings of the 2023 IEEE International Conference on Robotics and Automation (ICRA), London, UK, 29 May–2 June 2023; IEEE: New York, NY, USA, 2023; pp. 1609–1615. [Google Scholar]
- Ngiam, J.; Vasudevan, V.; Caine, B.; Zhang, Z.; Chiang, H.T.L.; Ling, J.; Roelofs, R.; Bewley, A.; Liu, C.; Venugopal, A.; et al. Scene transformer: A unified architecture for predicting future trajectories of multiple agents. In Proceedings of the International Conference on Learning Representations, Virtual, 3–7 May 2021. [Google Scholar]
- Zhou, Z.; Ye, L.; Wang, J.; Wu, K.; Lu, K. Hivt: Hierarchical vector transformer for multi-agent motion prediction. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, New Orleans, LA, USA, 18–24 June 2022; pp. 8823–8833. [Google Scholar]
- Da, F.; Zhang, Y. Path-aware graph attention for hd maps in motion prediction. In Proceedings of the 2022 International Conference on Robotics and Automation (ICRA), Philadelphia, PA, USA, 23–27 May 2022; IEEE: New York, NY, USA, 2022; pp. 6430–6436. [Google Scholar]
- Khakzar, M.; Rakotonirainy, A.; Bond, A.; Dehkordi, S.G. A dual learning model for vehicle trajectory prediction. IEEE Access 2020, 8, 21897–21908. [Google Scholar] [CrossRef]
# of LTC Neurons | Prediction Horizon | ||||
---|---|---|---|---|---|
1 s | 2 s | 3 s | 4 s | 5 s | |
Attn-LTC-8 | 0.64 | 1.20 | 1.86 | 2.73 | 3.81 |
Attn-LTC-16 | 0.49 | 0.99 | 1.60 | 2.38 | 3.32 |
Attn-LTC-24 | 0.49 | 0.98 | 1.57 | 2.33 | 3.24 |
Attn-LTC-32 | 0.48 | 0.99 | 1.60 | 2.35 | 3.24 |
Model | Prediction Horizon | ||||
---|---|---|---|---|---|
1 s | 2 s | 3 s | 4 s | 5 s | |
CV | 0.73 | 1.78 | 3.13 | 4.78 | 6.68 |
S-LSTM [11] | 0.65 | 1.31 | 2.16 | 3.25 | 4.55 |
CS-LSTM [12] | 0.61 | 1.27 | 2.09 | 3.10 | 4.37 |
MHA-LSTM [22] | 0.56 | 1.22 | 2.01 | 3.00 | 4.25 |
DSCAN [14] | 0.58 | 1.26 | 2.03 | 2.98 | 4.13 |
SIT [15] | 0.58 | 1.23 | 1.99 | 2.96 | 4.05 |
DLM [29] | 0.41 | 0.95 | 1.72 | 2.64 | 3.87 |
STAM-LSTM [13] | 0.43 | 0.96 | 1.60 | 2.37 | 3.24 |
Attn-LTC (This Work) | 0.49 | 0.98 | 1.57 | 2.33 | 3.24 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Published by MDPI on behalf of the World Electric Vehicle Association. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Wang, R.; Chen, Y.; Ding, R.; Ye, Q. Parameter-Efficient Vehicle Trajectory Prediction Based on Attention-Enhanced Liquid Structural Neural Model. World Electr. Veh. J. 2025, 16, 19. https://doi.org/10.3390/wevj16010019
Wang R, Chen Y, Ding R, Ye Q. Parameter-Efficient Vehicle Trajectory Prediction Based on Attention-Enhanced Liquid Structural Neural Model. World Electric Vehicle Journal. 2025; 16(1):19. https://doi.org/10.3390/wevj16010019
Chicago/Turabian StyleWang, Ruochen, Yue Chen, Renkai Ding, and Qing Ye. 2025. "Parameter-Efficient Vehicle Trajectory Prediction Based on Attention-Enhanced Liquid Structural Neural Model" World Electric Vehicle Journal 16, no. 1: 19. https://doi.org/10.3390/wevj16010019
APA StyleWang, R., Chen, Y., Ding, R., & Ye, Q. (2025). Parameter-Efficient Vehicle Trajectory Prediction Based on Attention-Enhanced Liquid Structural Neural Model. World Electric Vehicle Journal, 16(1), 19. https://doi.org/10.3390/wevj16010019