More Web Proxy on the site http://driver.im/

Article

WAG-NAT: Window Attention and Generator Based Non-Autoregressive Transformer for Time Series Forecasting

Authors:

Chen XuAuthors Info & Claims

Artificial Neural Networks and Machine Learning – ICANN 2023: 32nd International Conference on Artificial Neural Networks, Heraklion, Crete, Greece, September 26–29, 2023, Proceedings, Part VI

Pages 293 - 304

https://doi.org/10.1007/978-3-031-44223-0_24

Published: 26 September 2023 Publication History

Abstract

Time series forecasting plays a crucial part in many real-world applications. Recent studies have proven the power of Transformer to model long-range dependency for time series forecasting tasks. Nevertheless, the quadratic computational complexity of self-attention is the major obstacle to application. Previous studies focus on structural adjustments of the attention mechanism to achieve more efficient computation. In contrast, local attention has better performance than full attention in feature extraction and computation simplification due to the sparsity of the attention mechanism. Besides, in practice, the speed of inference is more significant, which is also a key factor. In response to these, we develop a novel non-autoregressive Transformer model based on window attention and generator, namely WAG-NAT. The generator allows one-step-forward inference. The window attention module contains a window self-attention layer to capture local patterns and a window interaction layer to fuse information among different windows. Experimental results show that WAG-NAT has a distinct improvement in prediction accuracy compared with RNNs, CNNs, and other previous Transformer-based models across various benchmarks. Our implementation is available at https://github.com/cybisolated/WAG-NAT.

References

[1]

Agarwal O and Nenkova A Temporal effects on pre-trained models for language processing tasks Trans. Assoc. Comput. Linguist. 2022 10 904-921

[2]

Akiba, T., Sano, S., Yanase, T., Ohta, T., Koyama, M.: Optuna: a next-generation hyperparameter optimization framework. In: KDD (2019)

[3]

Bahdanau, D., Cho, K., Bengio, Y.: Neural machine translation by jointly learning to align and translate. In: ICLR (2015)

[4]

Bai, S., Kolter, J.Z., Koltun, V.: An empirical evaluation of generic convolutional and recurrent networks for sequence modeling. arXiv:1803.01271 (2018)

[5]

Chen, K., Chen, G., Xu, D., Zhang, L., Huang, Y., Knoll, A.: NAST: non-autoregressive spatial-temporal transformer for time series forecasting. arXiv preprint arXiv:2102.05624 (2021)

[6]

Chen Y, Chen X, Xu A, Sun Q, and Peng X A hybrid CNN-transformer model for ozone concentration prediction Air Qual. Atmos. Hlth. 2022 15 9 1533-1546

[7]

Gal, Y., Ghahramani, Z.: A theoretically grounded application of dropout in recurrent neural networks. In: NIPS (2016)

[8]

Gu, J., Bradbury, J., Xiong, C., Li, V.O.K., Socher, R.: Non-autoregressive neural machine translation. In: ICLR (2018)

[9]

Hewage P, Trovati M, Pereira E, and Behera A Deep learning-based effective fine-grained weather forecasting model Pattern Anal. Appl. 2021 24 1 343-366

[10]

Ioannou, Y., Robertson, D., Cipolla, R., Criminisi, A.: Deep roots: improving CNN efficiency with hierarchical filter groups. In: ICCV (2017)

[11]

Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. In: ICLR (2015)

[12]

Koprinska, I., Wu, D., Wang, Z.: Convolutional neural networks for energy time series forecasting. In: IJCNN (2018)

[13]

Li, S., et al.: Enhancing the locality and breaking the memory bottleneck of transformer on time series forecasting. In: NIPS (2019)

[14]

Liu, Z., et al.: Swin transformer: hierarchical vision transformer using shifted windows. In: ICCV (2021)

[15]

Noh SH Analysis of gradient vanishing of RNNs and performance comparison Information 2021 12 11 442

[16]

Nosratabadi S et al. Data science in economics: comprehensive review of advanced machine learning and deep learning methods Mathematics 2020 8 10 1799

[17]

Salman, A.G., Kanigoro, B., Heryadi, Y.: Weather forecasting using deep learning techniques. In: ICACSIS (2015)

[18]

Woschank M, Rauch E, and Zsifkovits H A review of further directions for artificial intelligence, machine learning, and deep learning in smart logistics Sustainability 2020 12 9 3760

[19]

Wu, N., Green, B., Ben, X., O’Banion, S.: Deep transformer models for time series forecasting: the influenza prevalence case. arXiv preprint arXiv:2001.08317 (2020)

[20]

Zhou, H., et al.: Informer: beyond efficient transformer for long sequence time-series forecasting. In: AAAI (2021)

Recommendations

Combining seasonal ARIMA models with computational intelligence techniques for time series forecasting

Seasonal autoregressive integrated moving average (SARIMA) models form one of the most popular and widely used seasonal time series models over the past three decades. However, in several researches it has been argued that they have two basic ...
SWGARCH model for time series forecasting
IML '17: Proceedings of the 1st International Conference on Internet of Things and Machine Learning

Generalized Autoregressive Conditional Heteroskedasticity (GARCH) is one of the most popular time series models that can be used for time series forecasting. However, the computation of the long run variance in the GARCH model is based on the historical ...
Statistical and deep learning models for reference evapotranspiration time series forecasting: A comparison of accuracy, complexity, and data efficiency
Graphical abstract

Display Omitted
Highlights
- Ten statistical, machine learning, and deep learning models forecasted ETo.
- Data from 107 California weather stations were categorized based on their length.
- Most deep learning models suffer from low data efficiency.
- Simpler ...
Abstract
Reference evapotranspiration (ETo) is an essential variable in agricultural water resources management and irrigation scheduling. An accurate and reliable forecast of ETo facilitates effective decision-making in agriculture. Although numerous ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Guide Proceedings

Artificial Neural Networks and Machine Learning – ICANN 2023: 32nd International Conference on Artificial Neural Networks, Heraklion, Crete, Greece, September 26–29, 2023, Proceedings, Part VI

Sep 2023

620 pages

ISBN:978-3-031-44222-3

DOI:10.1007/978-3-031-44223-0

Editors:
Lazaros Iliadis
https://ror.org/03bfqnx40Democritus University of Thrace, Xanthi, Greece
,
Antonios Papaleonidas
https://ror.org/03bfqnx40Democritus University of Thrace, Xanthi, Greece
,
Plamen Angelov
https://ror.org/04f2nsd36Lancaster University, Lancaster, UK
,
Chrisina Jayne
https://ror.org/03z28gk75Teesside University, Middlesbrough, UK

© The Author(s), under exclusive license to Springer Nature Switzerland AG 2023.

Publisher

Springer-Verlag

Berlin, Heidelberg

Publication History

Published: 26 September 2023

Author Tags

Qualifiers

Article

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
0
Total Downloads

Downloads (Last 12 months)0
Downloads (Last 6 weeks)0

Reflects downloads up to 03 Mar 2025

Other Metrics

View Author Metrics

Citations

View Options

View options

Figures

Tables

Media

View Table of Conten