[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
research-article

CSR: : Cascade Conditional Variational Auto Encoder with Socially-aware Regression for Pedestrian Trajectory Prediction

Published: 01 January 2023 Publication History

Highlights

The proposed trajectory prediction method consists of a cascaded CVAE module and a socially aware regression module.
The cascaded CVAE module decouples and balances the loss function with respect to time steps and minimizes the losses at every time steps independently.
The socially aware regression module corrects the predictions by checking the compatibility between the interaction coding and the crude predicted trajectories.

Abstract

Pedestrian trajectory prediction is a key technology in many real applications such as video surveillance, social robot navigation, and autonomous driving, and significant progress has been made in this research topic. However, there remain two limitations of previous studies. First, the losses of the last time steps are heavier weighted than that of the beginning time steps in the objective function at the learning stage, causing the prediction errors generated at the beginning to accumulate to large errors at the last time steps at the inference stage. Second, the prediction results of multiple pedestrians in the prediction horizon might be socially incompatible with the interactions modeled by past trajectories. To overcome these limitations, this work proposes a novel trajectory prediction method called CSR, which consists of a cascaded conditional variational autoencoder (CVAE) module and a socially-aware regression module. The CVAE module estimates the future trajectories in a cascaded sequential manner. Specifically, each CVAE concatenates the past trajectories and the predicted location points so far as the input and predicts the adjacent location at the following time step. The socially-aware regression module generates offsets from the estimated future trajectories to produce the corrected predictions, which are more reasonable and accurate than the estimated trajectories. Experiments results demonstrate that the proposed method exhibits significant improvements over state-of-the-art methods on the Stanford Drone Dataset (SDD) and the ETH/UCY dataset of approximately 38.0% and 22.2%, respectively. The code is available at https://github.com/zhouhao94/CSR.

References

[1]
J. Hong, B. Sapp, J. Philbin, Rules of the road: Predicting driving behavior with a convolutional model of semantic interactions, The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019.
[2]
P. Raksincharoensak, T. Hasegawa, M. Nagai, Motion planning and control of autonomous driving intelligence system based on risk potential optimization framework, Int. J. Automot. Eng. 7 (AVEC14) (2016) 53–60.
[3]
Y. Luo, P. Cai, A. Bera, D. Hsu, W.S. Lee, D. Manocha, Porca: modeling and planning for autonomous driving among many pedestrians, IEEE Rob. Autom. Lett. 3 (4) (2018) 3418–3425.
[4]
H. Xue, D. Huynh, M. Reynolds, Location-velocity attention for pedestrian trajectory prediction, 2019 IEEE Winter Conference on Applications of Computer Vision (WACV), 2019, pp. 2038–2047,.
[5]
K. Mangalam, H. Girase, S. Agarwal, K.-H. Lee, E. Adeli, J. Malik, A. Gaidon, It is not the journey but the destination: Endpoint conditioned trajectory prediction, European Conference on Computer Vision, Springer, 2020, pp. 759–776.
[6]
A. Robicquet, A. Sadeghian, A. Alahi, S. Savarese, Learning social etiquette: Human trajectory understanding in crowded scenes, European conference on computer vision, Springer, 2016, pp. 549–565.
[7]
S. Pellegrini, A. Ess, K. Schindler, L. Van Gool, You’ll never walk alone: Modeling social behavior for multi-target tracking, 2009 IEEE 12th International Conference on Computer Vision, IEEE, 2009, pp. 261–268.
[8]
A. Lerner, Y. Chrysanthou, D. Lischinski, Crowds by example, Computer graphics forum, volume 26, Wiley Online Library, 2007, pp. 655–664.
[9]
S. Lefèvre, C. Laugier, J. Ibañez Guzmán, Exploiting map information for driver intention estimation at road intersections, 2011 ieee intelligent vehicles symposium (iv), IEEE, 2011, pp. 583–588.
[10]
C.E. Ramussen, C. Williams, Gaussian processes for machine learning (adaptive computation and machine learning), 2006, (????).
[11]
R. Toledo-Moreo, M.A. Zamora-Izquierdo, Imm-based lane-change prediction in highways with low-cost gps/ins, IEEE Trans. Intell. Transp. Syst. 10 (1) (2009) 180–185.
[12]
X. Li, X. Ying, M.C. Chuah, Grip: Graph-based interaction-aware trajectory prediction, 2019 IEEE Intelligent Transportation Systems Conference (ITSC), IEEE, 2019, pp. 3960–3966.
[13]
J. Gao, C. Sun, H. Zhao, Y. Shen, D. Anguelov, C. Li, C. Schmid, Vectornet: Encoding hd maps and agent dynamics from vectorized representation, Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020, pp. 11525–11533.
[14]
A. Mohamed, K. Qian, M. Elhoseiny, C. Claudel, Social-stgcnn: A social spatio-temporal graph convolutional neural network for human trajectory prediction, Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020, pp. 14424–14432.
[15]
H. Zhou, D. Ren, H. Xia, M. Fan, X. Yang, H. Huang, Ast-gnn: an attention-based spatio-temporal graph neural network for interaction-aware pedestrian trajectory prediction, Neurocomputing 445 (2021) 298–308.
[16]
S. Zamboni, Z.T. Kefato, S. Girdzijauskas, C. Norén, L. Dal Col, Pedestrian trajectory prediction with convolutional neural networks, Pattern Recognit. 121 (2022) 108252,.
[17]
A. Sadeghian, V. Kosaraju, A. Gupta, S. Savarese, A. Alahi, Trajnet: towards a benchmark for human trajectory prediction, arXiv preprint (2018).
[18]
S. Hochreiter, J. Schmidhuber, Long short-term memory, Neural Comput. 9 (8) (1997) 1735–1780.
[19]
J. Chung, C. Gulcehre, K. Cho, Y. Bengio, Empirical evaluation of gated recurrent neural networks on sequence modeling, NIPS 2014 Workshop on Deep Learning, December 2014, 2014.
[20]
A. Alahi, K. Goel, V. Ramanathan, A. Robicquet, L. Fei-Fei, S. Savarese, Social lstm: Human trajectory prediction in crowded spaces, Proceedings of the IEEE conference on computer vision and pattern recognition, 2016, pp. 961–971.
[21]
Y. Xu, J. Yang, S. Du, Cf-lstm: Cascaded feature-based long short-term networks for predicting pedestrian trajectory, Proceedings of the AAAI Conference on Artificial Intelligence, volume 34, 2020, pp. 12541–12548.
[22]
A. Gupta, J. Johnson, L. Fei-Fei, S. Savarese, A. Alahi, Social gan: Socially acceptable trajectories with generative adversarial networks, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2018, pp. 2255–2264.
[23]
A. Sadeghian, V. Kosaraju, A. Sadeghian, N. Hirose, H. Rezatofighi, S. Savarese, Sophie: An attentive gan for predicting paths compliant to social and physical constraints, Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019, pp. 1349–1358.
[24]
J. Liang, L. Jiang, J.C. Niebles, A.G. Hauptmann, L. Fei-Fei, Peeking into the future: Predicting future person activities and locations in videos, Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019, pp. 5725–5734.
[25]
P. Zhang, J. Xue, P. Zhang, N. Zheng, W. Ouyang, Social-aware pedestrian trajectory prediction via states refinement LSTM, IEEE Trans. Pattern Anal. Mach. Intell. (2020),.
[26]
Z. Pei, X. Qi, Y. Zhang, M. Ma, Y.-H. Yang, Human trajectory prediction in crowded scene using social-affinity long short-term memory, Pattern Recognit. 93 (2019) 273–282.
[27]
Z. Huang, J. Wang, L. Pi, X. Song, L. Yang, Lstm based trajectory prediction model for cyclist utilizing multiple interactions with environment, Pattern Recognit. 112 (2021) 107800,.
[28]
T. Salzmann, B. Ivanovic, P. Chakravarty, M. Pavone, Trajectron++: Dynamically-feasible trajectory forecasting with heterogeneous data, Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part XVIII 16, Springer, 2020, pp. 683–700.
[29]
J. Li, F. Yang, M. Tomizuka, C. Choi, Evolvegraph: multi-agent trajectory prediction with dynamic relational reasoning, Adv. Neural Inf. Process. Syst. 33 (2020) 19783–19794.
[30]
H. Zhao, J. Gao, T. Lan, C. Sun, B. Sapp, B. Varadarajan, Y. Shen, Y. Shen, Y. Chai, C. Schmid, et al., Tnt: target-driven trajectory prediction, arXiv preprint arXiv:2008.08294 (2020).
[31]
K. Mangalam, Y. An, H. Girase, J. Malik, From goals, waypoints & paths to long term human trajectory forecasting, Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021, pp. 15233–15242.
[32]
H. Girase, H. Gang, S. Malla, J. Li, A. Kanehara, K. Mangalam, C. Choi, Loki: Long term and key intentions for trajectory prediction, Proceedings of the IEEE/CVF International Conference on Computer Vision, 2021, pp. 9803–9812.
[33]
C. Wong, B. Xia, Q. Peng, X. You, Msn: multi-style network for trajectory prediction, arXiv preprint arXiv:2107.00932 (2021).
[34]
N. Deo, M.M. Trivedi, Trajectory forecasts in unknown environments conditioned on grid-based plans, arXiv preprint arXiv:2001.00735 (2020).
[35]
V. Kosaraju, A. Sadeghian, R. Martín-Martín, I. Reid, H. Rezatofighi, S. Savarese, Social-biGAT: multimodal trajectory forecasting using bicycle-GAN and graph attention networks, Adv. Neural Inf. Process. Syst. 32 (2019) 137–146.
[36]
A. Paszke, S. Gross, F. Massa, A. Lerer, J. Bradbury, G. Chanan, T. Killeen, Z. Lin, N. Gimelshein, L. Antiga, et al., Pytorch: an imperative style, high-performance deep learning library, Adv. Neural Inf. Process. Syst. 32 (2019) 8026–8037.
[37]
D.P. Kingma, J. Ba, Adam: A method for stochastic optimization, International Conference on Learning Representations (ICLR), 2015.
[38]
N. Lee, W. Choi, P. Vernaza, C.B. Choy, P.H.S. Torr, M. Chandraker, Desire: Distant future prediction in dynamic scenes with interacting agents, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2017, pp. 336–345.
[39]
J. Liang, L. Jiang, A. Hauptmann, Simaug: Learning robust representations from simulation for trajectory prediction, European Conference on Computer Vision, Springer, 2020, pp. 275–292.
[40]
A. Bhattacharyya, M. Hanselmann, M. Fritz, B. Schiele, C.-N. Straehle, Conditional flow variational autoencoders for structured sequence prediction, 4th Workshop on Bayesian Deep Learning, bayesiandeeplearning. org, 2019.
[41]
J. Li, H. Ma, M. Tomizuka, Conditional generative neural system for probabilistic trajectory prediction, 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), IEEE, 2019, pp. 6150–6156.

Cited By

View all
  • (2024)SocialCVAEProceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence and Thirty-Sixth Conference on Innovative Applications of Artificial Intelligence and Fourteenth Symposium on Educational Advances in Artificial Intelligence10.1609/aaai.v38i6.28439(6216-6224)Online publication date: 20-Feb-2024

Index Terms

  1. CSR: Cascade Conditional Variational Auto Encoder with Socially-aware Regression for Pedestrian Trajectory Prediction
      Index terms have been assigned to the content through auto-classification.

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image Pattern Recognition
      Pattern Recognition  Volume 133, Issue C
      Jan 2023
      828 pages

      Publisher

      Elsevier Science Inc.

      United States

      Publication History

      Published: 01 January 2023

      Author Tags

      1. Pedestrian trajectory prediction
      2. Socially-aware model
      3. Conditional variational autoencoder (CVAE)

      Author Tags

      1. 11-01
      2. 99-00

      Qualifiers

      • Research-article

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)0
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 20 Jan 2025

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)SocialCVAEProceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence and Thirty-Sixth Conference on Innovative Applications of Artificial Intelligence and Fourteenth Symposium on Educational Advances in Artificial Intelligence10.1609/aaai.v38i6.28439(6216-6224)Online publication date: 20-Feb-2024

      View Options

      View options

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media